FlashPortrait makes talking-portrait videos that keep a person’s identity steady for as long as you want—minutes or even hours.
Saber is a new way to make videos that match a text description while keeping the look of people or objects from reference photos, without needing special triplet datasets.
This paper teaches image models to keep things consistent across multiple pictures—like the same character, art style, and story logic—using reinforcement learning (RL).