Video generators are slow because attention looks at everything, which takes a lot of time.
Diffusion models make great images and videos but are slow because they usually need many tiny steps.
LongVie 2 is a video world model that can generate controllable videos for 3โ5 minutes while keeping the look and motion steady over time.