FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs
IntermediateQian Chen, Jinlan Fu et al.Jan 20arXiv
FutureOmni is the first benchmark that tests if multimodal AI models can predict what happens next from both sound and video, not just explain what already happened.
#multimodal LLM#audio-visual reasoning#future forecasting