This paper teaches video-making AI models to say how sure they are about each tiny part of every frame they create.
SCAIL is a new AI system that turns a single character image into a studio-quality animation by following the moves in a driving video.
Reinforcement learning (RL) can make big language models smarter, but off-policy training often pushes updates too far from the “safe zone,” causing unstable learning.
ProPhy is a new two-step method that helps video AIs follow real-world physics, not just make pretty pictures.
BEAVER is a new way to check, with guaranteed certainty, how likely a language model is to give answers that obey important rules.
SpaceControl lets you steer a powerful 3D generator with simple shapes you draw, without retraining the model.
This paper builds TAD, a brand-new test that checks if AI can understand what happens over time in real driving videos.
This paper teaches a computer to turn one single picture into a moving 3D scene that stays consistent from every camera angle.
ARBITRAGE makes AI solve step-by-step problems faster by only using the big, slow model when it is predicted to truly help.
EMMA is a single AI model that can understand images, write about them, create new images from text, and edit images—all in one unified system.
Large language models forget or misuse new facts if you only poke their weights once; EtCon fixes this with a two-step plan.
TwinFlow is a new way to make big image models draw great pictures in just one step instead of 40–100 steps.