The paper asks a simple question: Which step-by-step explanations from a teacher model actually help a student model learn to reason better?
The paper proposes the Laws of Reasoning (LORE), simple rules that say how much a model should think and how accurate it can be as problems get harder.
This paper teaches computers to understand words by also looking at the smaller pieces inside words, like 'un-', 'play', and '-ing'.