This paper finds that about 1 out of every 4 attention heads in autoregressive video diffusion models mostly looks only at the current frame and almost ignores the past, wasting memory and time.
HiStream makes 1080p video generation much faster by removing repeated work across space, time, and steps.