SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature
IntermediateYiming Ren, Junjie Wang et al.Jan 15arXiv
The paper introduces SIN-Bench, a new way to test AI that read long scientific papers by forcing them to show exactly where their answers come from.
#multimodal large language models#long-context reasoning#evidence chains