Cosmos Policy teaches robots to act by fine-tuning a powerful video model in just one training stage, without changing the modelβs architecture.
Dream2Flow lets a robot watch a short, AI-generated video of a task and then do that task in real life by following object motion in 3D.
SurgWorld teaches surgical robots using videos plus text, then guesses the missing robot moves so we can train good policies without collecting tons of real robot-action data.