LingBot-VLA is a robot brain that listens to language, looks at the world, and decides smooth actions to get tasks done.
This paper builds a foundation model called DAP that estimates real-world (metric) depth from any 360° panorama, indoors or outdoors.