This paper teaches text-to-video models to follow real-world physics, so people, balls, water, glass, and fire act the way they should.
RePlan is a plan-then-execute system that first figures out exactly where to edit in a picture and then makes clean changes there.
UniUGP is a single system that learns to understand road scenes, explain its thinking, plan safe paths, and even imagine future video frames.
ARBITRAGE makes AI solve step-by-step problems faster by only using the big, slow model when it is predicted to truly help.