SpatiaLab: Can Vision-Language Models Perform Spatial Reasoning in the Wild?
IntermediateAzmine Toushik Wasi, Wahid Faisal et al.Feb 3arXiv
SpatiaLab is a new test that checks if vision-language models (VLMs) can understand real-world spatial puzzles, like whatβs in front, behind, bigger, or reachable.
#SpatiaLab#spatial reasoning#vision-language models