Multimodal AI models can mix up what they see and what they hear, making things up across senses; this is called cross-modal hallucination.
This paper introduces PCED, a way to use many documents as separate 'experts' in parallel so an AI can stitch answers together without stuffing everything into one giant prompt.