SAGE: Benchmarking and Improving Retrieval for Deep Research Agents
IntermediateTiansheng Hu, Yilun Zhao et al.Feb 5arXiv
SAGE is a new test for how well AI research agents find scientific papers when questions require multi-step reasoning.
#SAGE benchmark#scientific literature retrieval#deep research agents