The paper asks a simple question: Which step-by-step explanations from a teacher model actually help a student model learn to reason better?
ToolPRMBench is a new benchmark that checks, step by step, whether an AI agent using tools picks the right next action.