RoboBrain 2.5 teaches robots to see depth precisely and to keep track of time-aware progress, so plans turn into safe, accurate actions.
This paper teaches a vision-language model to first find objects in real 3D space (not just 2D pictures) and then reason about where things are.