Green-VLA: Staged Vision-Language-Action Model for Generalist Robots
IntermediateI. Apanasevich, M. Artemyev et al.Jan 31arXiv
Green-VLA is a step-by-step training recipe that teaches one model to see, understand language, and move many kinds of robots safely and efficiently.
#Vision-Language-Action#Unified Action Space#Multi-embodiment Pretraining