IVRA is a simple, training-free add-on that helps robot brains keep the 2D shape of pictures while following language instructions.
Robots usually learn by copying many demonstrations, which is expensive and makes them brittle when things change.
Robots often see the world as flat pictures but must move in a 3D world, which makes accurate actions hard.