Papers7

#agentic reasoning

Experiential Reinforcement Learning

Taiwei Shi, Sihao Chen et al.Feb 15arXiv

This paper teaches AI models to learn like good students: try, think about what went wrong, fix it, and remember the fix.

#Experiential Reinforcement Learning#self-reflection#distillation

Not triaged yet

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Intermediate

Dongyang Chen, Chaoyang Wang et al.Feb 5arXiv

V-Retrver is a new way for AI to search across text and images by double-checking tiny visual details instead of only guessing from words.

#V-Retrver#multimodal retrieval#agentic reasoning

Not triaged yet

Beyond Pixels: Visual Metaphor Transfer via Schema-Driven Agentic Reasoning

Intermediate

Yu Xu, Yuxin Zhang et al.Feb 1arXiv

This paper teaches AI to copy the hidden idea inside a picture (a visual metaphor) and reuse that idea on a brand‑new subject.

#visual metaphor#metaphor transfer#schema grammar

Not triaged yet

Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Intermediate

Haotian Li, Shijun Yang et al.Jan 26arXiv

This paper builds an AI agent that learns new skills while working, like a kid who learns new tricks during recess without a teacher telling them what to do.

#in-situ self-evolution#tool evolution#parallel batch evolution

Not triaged yet

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

Intermediate

Junqi Liu, Zihao Zhou et al.Jan 20arXiv

Numina-Lean-Agent is a new open system that uses a general coding agent to write and check exact math proofs in Lean without special training.

#formal theorem proving#Lean#agentic reasoning

Not triaged yet

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Intermediate

Jungho Cho, Minbyul Jeong et al.Jan 13arXiv

The paper builds a new way to create realistic, long conversations between people and AI that use tools like databases.

#multi-turn dialogue generation#tool use#user simulation

Not triaged yet

SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Intermediate

Yong Xien Chng, Tao Hu et al.Dec 30arXiv

SenseNova-MARS is a vision-language model that can think step-by-step and use three tools—text search, image search, and image cropping—during its reasoning.

#multimodal agent#vision-language model#reinforcement learning

Not triaged yet