Papers2

#solvability verification

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

This paper shows that code-writing AI agents can take an existing math problem and automatically turn it into a new, harder one while keeping it solvable.

#code agents#multi-agent systems#mathematical reasoning

Not triaged yet

CoDiQ: Test-Time Scaling for Controllable Difficult Question Generation

Intermediate

Zhongyuan Peng, Caijun Xu et al.Feb 2arXiv

CoDiQ is a recipe for making hard-but-solvable math and coding questions on purpose, and it controls how hard they get while you generate them.

#controllable difficulty#test-time scaling#question generation

Not triaged yet