Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
IntermediateIan Wu, Yuxiao Qu et al.Feb 3arXiv
Reasoning Cache (RC) is a new way for AI to think in steps: it writes some thoughts, makes a short summary, throws away the long thoughts, and then keeps going using only the summary.
#Reasoning Cache#iterative decoding#summary-conditioned reasoning