This paper teaches a language model to write fast GPU kernels (tiny speed programs) in Triton using reinforcement learning that really cares about meaningful speed, not just being correct.
PACEvolve is a new recipe that helps AI agents improve their ideas step by step over long periods without getting stuck.