How I Study AI - Learn AI Papers & Lectures the Easy Way

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Intermediate

Dadi Guo, Yuejin Xie et al.Mar 3arXiv

This paper shows that code-writing AI agents can take an existing math problem and automatically turn it into a new, harder one while keeping it solvable.

#code agents#multi-agent systems#mathematical reasoning

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Intermediate

Chuxue Cao, Jinluan Yang et al.Jan 30arXiv

Large language models sometimes reach the right answer for the wrong reasons, which is risky and confusing.

#formal logic verification#interleaved verification#neuro-symbolic reasoning

VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning

Intermediate

Vikash Singh, Darion Cassel et al.Jan 27arXiv

VERGE is a teamwork system where an AI writer (an LLM) works with a strict math checker (an SMT solver) to make answers both smart and logically sound.

#VERGE#neurosymbolic reasoning#SMT solver

Papers3

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning