Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention
IntermediateDvir Samuel, Issar Tzachor et al.Feb 2arXiv
The paper makes long video generation much faster and lighter on memory by cutting out repeated work in attention.
#autoregressive video diffusion#KV cache compression#sparse attention