Robots learn better when they predict short, meaningful summaries of future images instead of drawing every pixel of the future scene.
FOFPred is a new AI that reads one or two images plus a short instruction like โmove the bottle left to right,โ and then predicts how every pixel will move in the next moments.