Papers2

#RAG evaluation

Legal RAG Bench: an end-to-end benchmark for legal RAG

Abdur-Rahman Butler, Umar ButlerMar 2arXiv

Legal RAG Bench is a new, end-to-end test that checks how well legal AI systems find information and use it to answer tough, real-world legal questions.

#legal RAG#retrieval-augmented generation#embedding models

Not triaged yet

Over-Searching in Search-Augmented Large Language Models

Intermediate

Roy Xie, Deepak Gopinath et al.Jan 9arXiv

The paper shows that language models with a search tool often look up too much information, which wastes compute and can make answers worse on unanswerable questions.

#search-augmented LLMs#over-searching#abstention

Not triaged yet