Papers2

#evolutionary search

Learning to Discover at Test Time

Mert Yuksekgonul, Daniel Koceja et al.Jan 22arXiv

This paper shows how to keep training a language model while it is solving one hard, real problem, so it can discover a single, truly great answer instead of many average ones.

#test-time training#reinforcement learning#entropic objective

Not triaged yet

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Intermediate

Minghao Yan, Bo Peng et al.Jan 15arXiv

PACEvolve is a new recipe that helps AI agents improve their ideas step by step over long periods without getting stuck.

#evolutionary search#LLM agents#context management

Not triaged yet