Multimodal AI models can mix up what they see and what they hear, making things up across senses; this is called cross-modal hallucination.
The FACTS Leaderboard is a four-part test that checks how truthful AI models are across images, memory, web search, and document grounding.