NE-Dreamer is a model-based reinforcement learning agent that skips rebuilding pixels and instead learns by predicting the next step’s hidden features.
The paper introduces CHAIN, a hands-on 3D playground that tests if AI can not only see objects but also plan and act under real physics.
The paper shows that AI agents can learn to cooperate simply by playing lots of different kinds of opponents and figuring them out on the fly, without hardcoding how those opponents learn.
WorldCompass teaches video world models to follow actions better and keep pictures pretty by using reinforcement learning after pretraining.
MIND is a new benchmark that fairly tests two core skills of world models: remembering the world over time (memory consistency) and following controls exactly (action control).
World models are AI tools that imagine the future so a robot can plan what to do next, but they are expensive to run many times in a row.
The paper makes long video generation much faster and lighter on memory by cutting out repeated work in attention.
This paper argues that true world models are not just sprinkling facts into single tasks, but building a unified system that can see, think, remember, act, and generate across many situations.
This paper shows how a video generator can improve its own videos during sampling, without extra training or outside checkers.
This paper says modern video generators are starting to act like tiny "world simulators," not just pretty video painters.
Agents often act like tourists without a map: they react to what they see now and miss long-term consequences.
Video models can now be told what physical result you want (like “make this ball move left with a strong push”) using Goal Force, instead of just vague text or a final picture.