The paper introduces CHAIN, a hands-on 3D playground that tests if AI can not only see objects but also plan and act under real physics.
WorldCompass teaches video world models to follow actions better and keep pictures pretty by using reinforcement learning after pretraining.
Video models can now be told what physical result you want (like โmake this ball move left with a strong pushโ) using Goal Force, instead of just vague text or a final picture.