MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models
IntermediateSangyun Chung, Se Yeon Kim et al.Jan 29arXiv
Multimodal AI models can mix up what they see and what they hear, making things up across senses; this is called cross-modal hallucination.
#multimodal large language models#cross-modal hallucination#contrastive decoding