Exploring MLLM-Diffusion Information Transfer with MetaCanvas
IntermediateHan Lin, Xichen Pan et al.Dec 12arXiv
MetaCanvas lets a multimodal language model (MLLM) sketch a plan inside the generatorโs hidden canvas so diffusion models can follow it patch by patch.
#MetaCanvas#MLLM#Diffusion Transformer