RIVER Bench is a new test that checks how well AI can watch a video stream and talk with you in real time.
Short videos are easy for AI to make sharp and lively, but long videos need stories and memory, and there isn’t much training data for that.
FastVMT is a faster way to copy motion from one video to another without training a new model for each video.
LIVE is a new way to train video-making AIs so their mistakes don’t snowball over long videos.
This paper finds that about 1 out of every 4 attention heads in autoregressive video diffusion models mostly looks only at the current frame and almost ignores the past, wasting memory and time.
FlashPortrait makes talking-portrait videos that keep a person’s identity steady for as long as you want—minutes or even hours.