EMMA is a single AI model that can understand images, write about them, create new images from text, and edit imagesβall in one unified system.
TwinFlow is a new way to make big image models draw great pictures in just one step instead of 40β100 steps.
Before this work, big vision-language models (VLMs) were great at understanding pictures and words together but not at making new pictures.