Papers19

All Beginner Intermediate Advanced

All Sources arXiv

#Retrieval-Augmented Generation

Benchmarking Large Language Models for Knowledge Graph Validation

Beginner

Farzad Shami, Stefano Marchesin et al.Feb 11arXiv

Knowledge graphs are like giant fact maps, and keeping every fact correct is hard and important.

#Knowledge Graph Validation#Fact Checking#Large Language Models

Not triaged yet

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Intermediate

Mingxuan Du, Benfeng Xu et al.Feb 3arXiv

A-RAG lets the AI choose how to search, what to read, and when to stop, instead of following a fixed recipe.

#Agentic RAG#Hierarchical Retrieval Interfaces#Keyword Search

Not triaged yet

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Beginner

Pengyu Wang, Benfeng Xu et al.Feb 2arXiv

WildGraphBench is a new test that checks how well GraphRAG systems find and combine facts from messy, real-world web pages.

#GraphRAG#Retrieval-Augmented Generation#Wikipedia references

Not triaged yet

Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation

Intermediate

Kwun Hang Lau, Fangyuan Zhang et al.Feb 2arXiv

CatRAG is a new way for AI to find the right facts by letting the knowledge graph change its paths based on each question.

#Retrieval-Augmented Generation#Knowledge Graph#Personalized PageRank

Not triaged yet

AACR-Bench: Evaluating Automatic Code Review with Holistic Repository-Level Context

Intermediate

Lei Zhang, Yongda Yu et al.Jan 27arXiv

AACR-Bench is a new test set that checks how well AI can do code reviews using the whole project, not just one file.

#Automated Code Review#Benchmark#Repository-level Context

Not triaged yet

Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain

Intermediate

Özgür Uğur, Mahmut Göksu et al.Jan 22arXiv

The paper builds special Turkish legal AI models called Mecellem by teaching them from the ground up and then giving them more law-focused lessons.

#Turkish legal NLP#ModernBERT#Continual pre-training

Not triaged yet

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Intermediate

Wei Zhou, Jun Zhou et al.Jan 22arXiv

This survey explains how large language models (LLMs) can clean, connect, and enrich messy data so it’s ready for real apps like dashboards, fraud detection, and training AI.

#Data Preparation#Data Cleaning#Data Integration

Not triaged yet

Agentic Reasoning for Large Language Models

Intermediate

Tianxin Wei, Ting-Wei Li et al.Jan 18arXiv

This paper explains how to turn large language models (LLMs) from quiet students that only answer questions into active agents that can plan, act, and learn over time.

#Agentic Reasoning#LLM Agents#In-Context Learning

Not triaged yet

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Intermediate

Peizhou Huang, Zixuan Zhong et al.Jan 18arXiv

This paper introduces MMDeepResearch-Bench (MMDR-Bench), a new test that checks how well AI “deep research agents” write long, citation-rich reports using both text and images.

#Multimodal Deep Research#Benchmark#Citation Grounding

Not triaged yet

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Intermediate

Jiayu Liu, Rui Wang et al.Jan 16arXiv

The paper studies why large language models (LLMs) sound too sure of themselves when using retrieval-augmented generation (RAG) and how to fix it.

#Retrieval-Augmented Generation#Confidence Calibration#Expected Calibration Error

Not triaged yet

OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG

Intermediate

Fengran Mo, Zhan Su et al.Jan 13arXiv

OpenDecoder teaches large language models (LLMs) to pay more attention to better documents during Retrieval-Augmented Generation (RAG).

#Retrieval-Augmented Generation#LLM Decoding#Attention Modulation

Not triaged yet

Parallel Context-of-Experts Decoding for Retrieval Augmented Generation

Intermediate

Giulio Corallo, Paolo PapottiJan 13arXiv

This paper introduces PCED, a way to use many documents as separate 'experts' in parallel so an AI can stitch answers together without stuffing everything into one giant prompt.

#Retrieval-Augmented Generation#PCED#contrastive decoding

Not triaged yet

1 2