Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
IntermediateChi-Pin Huang, Yunze Man et al.Jan 14arXiv
Fast-ThinkAct teaches a robot to plan with a few tiny hidden "thought tokens" instead of long paragraphs, making it much faster while staying smart.
#Vision-Language-Action#latent reasoning#verbalizable planning