How I Study AI - Learn AI Papers & Lectures the Easy Way

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Intermediate

Dadi Guo, Yuejin Xie et al.Mar 3arXiv

This paper shows that code-writing AI agents can take an existing math problem and automatically turn it into a new, harder one while keeping it solvable.

#code agents#multi-agent systems#mathematical reasoning

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Intermediate

Guoxin Chen, Fanzhe Meng et al.Mar 3arXiv

BeyondSWE is a new benchmark that tests code agents on tougher, more real-life tasks than single-repo bug fixing.

#BeyondSWE#code agents#software engineering benchmark

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Intermediate

Junru Lu, Jiarui Qin et al.Dec 31arXiv

Youtu-LLM is a small (1.96B) language model that was trained from scratch to think, plan, and act like an agent instead of just copying bigger models.

#lightweight LLM#agentic mid-training#trajectory data

Papers3

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models