WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
IntermediateRunjie Zhou, Youbo Shao et al.Jan 28arXiv
WorldVQA is a new test that checks if multimodal AI models can correctly name what they see in pictures without doing extra reasoning.
#WorldVQA#atomic visual knowledge#multimodal large language models