Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
IntermediateHaocheng Xi, Shuo Yang et al.Feb 3arXiv
Auto-regressive video models make videos one chunk at a time but run out of GPU memory because the KV-cache grows with history.
#Quant VideoGen (QVG)#KV-cache quantization#2-bit quantization