VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory
IntermediateShaoan Wang, Yuanfei Luo et al.Jan 13arXiv
VLingNav is a robot navigation system that sees, reads instructions, and acts, while deciding when to think hard and when to just move.
#Vision-Language-Action#embodied navigation#adaptive chain-of-thought