Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
BeginnerZengbin Wang, Xuecai Hu et al.Jan 28arXiv
Text-to-image models draw pretty pictures, but often put things in the wrong places or miss how objects interact.
#text-to-image#spatial intelligence#occlusion