LIVE is a new way to train video-making AIs so their mistakes don’t snowball over long videos.
The paper fixes a hidden mistake many fast video generators were making when turning a "see-everything" model into a "see-past-only" model.
The paper makes long video generation much faster and lighter on memory by cutting out repeated work in attention.
This paper finds that about 1 out of every 4 attention heads in autoregressive video diffusion models mostly looks only at the current frame and almost ignores the past, wasting memory and time.
This paper introduces Knot Forcing, a way to make talking-head videos that look great while being generated live, frame by frame.
HiStream makes 1080p video generation much faster by removing repeated work across space, time, and steps.
This paper fixes a common problem in video-making AIs where tiny mistakes snowball over time and ruin long videos.