Papers17

All Beginner Intermediate Advanced

All Sources arXiv

#Retrieval-Augmented Generation

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Intermediate

Mingxuan Du, Benfeng Xu et al.Feb 3arXiv

A-RAG lets the AI choose how to search, what to read, and when to stop, instead of following a fixed recipe.

#Agentic RAG#Hierarchical Retrieval Interfaces#Keyword Search

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Beginner

Pengyu Wang, Benfeng Xu et al.Feb 2arXiv

WildGraphBench is a new test that checks how well GraphRAG systems find and combine facts from messy, real-world web pages.

#GraphRAG#Retrieval-Augmented Generation#Wikipedia references

AACR-Bench: Evaluating Automatic Code Review with Holistic Repository-Level Context

Intermediate

Lei Zhang, Yongda Yu et al.Jan 27arXiv

AACR-Bench is a new test set that checks how well AI can do code reviews using the whole project, not just one file.

#Automated Code Review#Benchmark#Repository-level Context

Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain

Intermediate

Özgür Uğur, Mahmut Göksu et al.Jan 22arXiv

The paper builds special Turkish legal AI models called Mecellem by teaching them from the ground up and then giving them more law-focused lessons.

#Turkish legal NLP#ModernBERT#Continual pre-training

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Intermediate

Wei Zhou, Jun Zhou et al.Jan 22arXiv

This survey explains how large language models (LLMs) can clean, connect, and enrich messy data so it’s ready for real apps like dashboards, fraud detection, and training AI.

#Data Preparation#Data Cleaning#Data Integration

Agentic Reasoning for Large Language Models

Intermediate

Tianxin Wei, Ting-Wei Li et al.Jan 18arXiv

This paper explains how to turn large language models (LLMs) from quiet students that only answer questions into active agents that can plan, act, and learn over time.

#Agentic Reasoning#LLM Agents#In-Context Learning

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Intermediate

Peizhou Huang, Zixuan Zhong et al.Jan 18arXiv

This paper introduces MMDeepResearch-Bench (MMDR-Bench), a new test that checks how well AI “deep research agents” write long, citation-rich reports using both text and images.

#Multimodal Deep Research#Benchmark#Citation Grounding

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Intermediate

Jiayu Liu, Rui Wang et al.Jan 16arXiv

The paper studies why large language models (LLMs) sound too sure of themselves when using retrieval-augmented generation (RAG) and how to fix it.

#Retrieval-Augmented Generation#Confidence Calibration#Expected Calibration Error

OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG

Intermediate

Fengran Mo, Zhan Su et al.Jan 13arXiv

OpenDecoder teaches large language models (LLMs) to pay more attention to better documents during Retrieval-Augmented Generation (RAG).

#Retrieval-Augmented Generation#LLM Decoding#Attention Modulation

Parallel Context-of-Experts Decoding for Retrieval Augmented Generation

Intermediate

Giulio Corallo, Paolo PapottiJan 13arXiv

This paper introduces PCED, a way to use many documents as separate 'experts' in parallel so an AI can stitch answers together without stuffing everything into one giant prompt.

#Retrieval-Augmented Generation#PCED#contrastive decoding

ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios

Intermediate

António Loison, Quentin Macé et al.Jan 13arXiv

ViDoRe V3 is a big, carefully built test that checks how well AI systems find and use information from both text and pictures (like tables and charts) in real documents.

#Retrieval-Augmented Generation#Multimodal RAG#Visual Document Understanding

DocDancer: Towards Agentic Document-Grounded Information Seeking

Intermediate

Qintong Zhang, Xinjie Lv et al.Jan 8arXiv

DocDancer is a smart document helper that answers questions by exploring and reading long, mixed-media PDFs using just two tools: Search and Read.

#Document Question Answering#Agentic Information Seeking#ReAct

1 2