DeepGen 1.0 is a small 5B-parameter model that can both make new images and smartly edit existing ones from text instructions.
MetaCanvas lets a multimodal language model (MLLM) sketch a plan inside the generatorโs hidden canvas so diffusion models can follow it patch by patch.