Papers27

#retrieval-augmented generation

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

DARE is a new way for AI assistants to find the right R functions by also looking at what the data looks like, not just the words in the question.

#distribution-aware retrieval#RPKB#RCodingAgent

Not triaged yet

Panini: Continual Learning in Token Space via Structured Memory

Intermediate

Shreyas Rajesh, Pavan Holur et al.Feb 16arXiv

Panini is a way for AI to keep learning new facts without changing its brain by storing them as tiny linked Q&A facts in an external memory.

#non-parametric continual learning#structured memory#Generative Semantic Workspace

Not triaged yet

AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research

Intermediate

Yishan Li, Wentong Chen et al.Feb 6arXiv

This paper teaches small, local AI models to write deep, insightful research reports by letting writing and planning work together instead of staying separate.

#AgentCPM-Report#WARP#Writing As Reasoning Policy

Not triaged yet

Semantic Search over 9 Million Mathematical Theorems

Intermediate

Luke Alexander, Eric Leonen et al.Feb 5arXiv

This paper builds a Google-for-theorems: a semantic search engine that finds exact theorems, lemmas, and propositions instead of just entire papers.

#semantic theorem search#mathematical information retrieval#dense retrieval

Not triaged yet

Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Intermediate

Bowei He, Minda Hu et al.Feb 3arXiv

This paper teaches AI to look things up on the web and fix its own mistakes mid-thought instead of starting over from scratch.

#search-integrated reasoning#reinforcement learning#credit assignment

Not triaged yet

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Intermediate

Haozhen Zhang, Quanyu Long et al.Feb 2arXiv

MemSkill turns memory operations for AI agents into learnable skills instead of fixed, hand-made rules.

#memory skills#LLM agents#skill bank

Not triaged yet

Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation

Intermediate

Jun He, Junyan Ye et al.Feb 2arXiv

Mind-Brush turns image generation from a one-step 'read the prompt and draw' into a multi-step 'think, research, and create' process.

#agentic image generation#multimodal reasoning#retrieval-augmented generation

Not triaged yet

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Intermediate

Shuangshuang Ying, Zheyu Wang et al.Jan 29arXiv

This paper builds a safe science “playground” called DeR that fairly tests how AI finds facts (retrieval) and how it thinks with those facts (reasoning) without mixing them up.

#retrieval-augmented generation#document-grounded reasoning#deep research benchmark

Not triaged yet

AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts

Intermediate

Shicheng Fang, Yuxin Wang et al.Jan 28arXiv

AgentLongBench is a new test that checks how well AI agents think over very long stories made of their own actions and the world's replies, not just by reading static documents.

#AgentLongBench#long-context agents#environment rollouts

Not triaged yet

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Intermediate

James Burgess, Jan N. Hansen et al.Jan 26arXiv

This paper teaches a language-model agent to look up facts in millions of scientific paper summaries and answer clear, single-answer questions.

#RLVR#search agents#PaperSearchQA

Not triaged yet

SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback

Intermediate

Fangyuan Xu, Rujun Han et al.Jan 26arXiv

SAGE is a two-agent system that automatically writes tough, multi-step search questions and checks them by actually trying to solve them.

#deep search#agentic data generation#execution feedback

Not triaged yet

DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal

Intermediate

Peixuan Han, Yingjie Yu et al.Jan 26arXiv

DRPG is a four-step AI helper that writes strong academic rebuttals by first breaking a review into parts, then fetching evidence, planning a strategy, and finally writing the response.

#academic rebuttal#agentic framework#planning with LLMs

Not triaged yet

1 2 3