DreamActor-M2 is a new way to make a still picture move by copying motion from a video while keeping the character’s look the same.
CoDance is a new way to animate many characters in one picture using just one pose video, even if the picture and the video do not line up perfectly.
Alterbute is a diffusion-based method that changes an object's intrinsic attributes (color, texture, material, shape) in a photo while keeping the object's identity and the scene intact.
MoCha is a new AI that swaps a person in a video with a new character using only one mask on one frame and a few reference photos.
VINO is a single AI model that can make and edit both images and videos by listening to text and looking at reference pictures and clips at the same time.
LiveTalk turns slow, many-step video diffusion into a fast, 4-step, real-time system for talking avatars that listen, think, and respond with synchronized video.
This paper introduces Knot Forcing, a way to make talking-head videos that look great while being generated live, frame by frame.
WorldCanvas lets you make videos where things happen exactly how you ask by combining three inputs: text (what happens), drawn paths called trajectories (when and where it happens), and reference images (who it is).
KlingAvatar 2.0 is a system that makes long, sharp, lifelike talking-person videos that follow audio, images, and text instructions all at once.
Scone is a new AI method that makes images from instructions while correctly picking the right subject even when many look similar.