WorldVQA is a new test that checks if multimodal AI models can correctly name what they see in pictures without doing extra reasoning.
Long AI tasks can go wrong early and keep getting worse, like a snowball of mistakes called the Spiral of Hallucination.