BabyVision: Visual Reasoning Beyond Language
IntermediateLiang Chen, Weichu Xie et al.Jan 10arXiv
BabyVision is a new test that checks if AI can handle the same basic picture puzzles that young children can do, without leaning on language tricks.
#BabyVision#visual reasoning#multimodal large language models