Robots learn better when they predict short, meaningful summaries of future images instead of drawing every pixel of the future scene.
Fast-ThinkAct teaches a robot to plan with a few tiny hidden "thought tokens" instead of long paragraphs, making it much faster while staying smart.