This paper shows how a video generator can improve its own videos during sampling, without extra training or outside checkers.
SkyReels-V3 is a single AI model that can make videos in three ways: from reference images, by extending an existing video, and by creating talking avatars from audio.
This paper says modern video generators are starting to act like tiny "world simulators," not just pretty video painters.
This paper teaches video-making AIs to follow real-world physics better without retraining them.
DreamID-V is a new AI method that swaps faces in videos while keeping the body movements, expressions, lighting, and background steady and natural.
SVBench is the first benchmark that checks whether video generation models can show realistic social behavior, not just pretty pictures.
DreaMontage is a new AI method that makes long, single-shot videos that feel smooth and connected, even when you give it scattered images or short clips in the middle.
Kling-Omni is a single, unified model that can understand text, images, and videos together and then make or edit high-quality videos from those mixed instructions.
Spatia is a video generator that keeps a live 3D map of the scene (a point cloud) as its memory while making videos.
UniUGP is a single system that learns to understand road scenes, explain its thinking, plan safe paths, and even imagine future video frames.