LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
BeginnerEthan Chern, Zhulin Hu et al.Dec 29arXiv
LiveTalk turns slow, many-step video diffusion into a fast, 4-step, real-time system for talking avatars that listen, think, and respond with synchronized video.
#real-time video diffusion#on-policy distillation#multimodal conditioning