The paper shows that, when teaching a reasoning AI with step-by-step examples, repeating a small set many times can beat using a huge set only once.
This paper shows a simple way for AI models to keep learning new things without forgetting what they already know.