DREAM is one model that both understands images (like CLIP) and makes images from text (like top text-to-image models).
VINO is a single AI model that can make and edit both images and videos by listening to text and looking at reference pictures and clips at the same time.