InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
IntermediateMatthew Y. R. Yang, Hao Bai et al.Jan 20arXiv
The paper introduces Intervention Training (InT), a simple way for a language model to find and fix the first wrong step in its own reasoning using a short, targeted correction.
#Intervention Training#credit assignment#LLM reasoning