OmniTransfer is a single system that learns from a whole reference video, not just one image, so it can copy how things look (identity and style) and how they move (motion, camera, effects).
3AM is a new way to track and segment the same object across a whole video, even when the camera view changes a lot.
PlenopticDreamer is a new way to remake a video from different camera paths while keeping everything consistent across views and over time.
This paper asks a simple question: do video AI models trained only on 2D videos secretly learn about 3D worlds?
MatSpray turns 2D guesses about what materials look like (color, shininess, metal) into a clean 3D model you can relight realistically.
InsertAnywhere is a two-stage system that lets you add a new object into any video so it looks like it was always there.
Digital humans used to just copy motions; this paper makes them think, speak, and move in sync like real people.