The paper asks a simple question: Which step-by-step explanations from a teacher model actually help a student model learn to reason better?
The paper proposes the Laws of Reasoning (LORE), simple rules that say how much a model should think and how accurate it can be as problems get harder.