KlingAvatar 2.0 is a system that makes long, sharp, lifelike talking-person videos that follow audio, images, and text instructions all at once.
The paper teaches a video generator to move things realistically by borrowing motion knowledge from a strong video tracker.
SCAIL is a new AI system that turns a single character image into a studio-quality animation by following the moves in a driving video.