UniUGP is a single system that learns to understand road scenes, explain its thinking, plan safe paths, and even imagine future video frames.
Role-playing agents need to juggle several goals at once, like staying in character, following instructions, and using the right tone.
The paper shows that many AI image generators are trained to prefer one popular idea of beauty, even when a user clearly asks for something messy, dark, blurry, or emotionally heavy.
VideoCoF is a new way to edit videos that first figures out WHERE to edit and then does the edit, like thinking before acting.