Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities
IntermediateShuangshuang Ying, Zheyu Wang et al.Jan 29arXiv
This paper builds a safe science βplaygroundβ called DeR that fairly tests how AI finds facts (retrieval) and how it thinks with those facts (reasoning) without mixing them up.
#retrieval-augmented generation#document-grounded reasoning#deep research benchmark