Quantifying the Gap between Understanding and Generation within Unified Multimodal Models
IntermediateChenlong Wang, Yuhang Chen et al.Feb 2arXiv
This paper shows that many AI models that both read images and write images are not truly unified insideโthey often understand well but fail to generate (or the other way around).
#Unified Multimodal Models#GAPEVAL#Gap Score