SpatiaLab is a new test that checks if vision-language models (VLMs) can understand real-world spatial puzzles, like whatβs in front, behind, bigger, or reachable.
The paper shows a simple way to teach AI models what not to learn by removing only the exact words (tokens) related to unwanted topics during pretraining.