Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
IntermediateJuil Koo, Daehyeon Choi et al.Dec 15arXiv
This paper teaches robots to move their camera to a better spot before answering a question about what they see.
#Active Perception#Embodied AI#Vision-Language Models