Robots learn better when they predict short, meaningful summaries of future images instead of drawing every pixel of the future scene.
Big idea: use a small, already-trained model to help a bigger model learn good habits early, so the big one trains faster and ends up smarter.