DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
BeginnerZefeng He, Xiaoye Qu et al.Dec 30arXiv
DiffThinker turns hard picture-based puzzles into an image-to-image drawing task instead of a long texting task.
#DiffThinker#Generative Multimodal Reasoning#Diffusion Models