When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
IntermediateLeheng Sheng, Yongtao Zhang et al.Feb 11arXiv
Long texts overwhelm many language models, which forget important bits and slow down as the context grows.
#gated recurrent memory#update gate#exit gate