This paper teaches a language model to improve its own math answers by first writing several drafts and then learning to beat its best draft.
This paper shows that giving an AI a safe, tiny virtual computer (a sandbox) lets it solve many kinds of problems better, not just coding ones.
Re-Align is a new way for AI to make and edit pictures by thinking in clear steps before drawing.
Falcon-H1R is a small (7B) AI model that thinks really well without needing giant computers.