Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models
IntermediateSen Ye, Mengde Xu et al.Feb 17arXiv
Big idea: Make image-making AIs stop, think, check, and fix their own work so they get better at both creating pictures and understanding them.
#multimodal models#image generation#reasoning