DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
IntermediateLunbin Zeng, Jingfeng Yao et al.Dec 17arXiv
This paper shows a simple way to turn any strong autoregressive (step-by-step) model into a diffusion vision-language model (parallel, block-by-block) without changing the architecture.
#DiffusionVL#diffusion vision-language model#block diffusion