This paper speeds up image and video generators called diffusion transformers by changing how big their puzzle pieces (patches) are at each step.
WorldWarp is a new method that turns a single photo plus a planned camera path into a long, steady, 3D-consistent video.
Normalizing Flows are models that learn how to turn real images into simple noise and then back again.