Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment
IntermediateYuming Yang, Mingyoung Lai et al.Jan 20arXiv
The paper asks a simple question: Which step-by-step explanations from a teacher model actually help a student model learn to reason better?
#Rank-Surprisal Ratio#data-student suitability#chain-of-thought distillation