Papers2

#test-time compute

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Reasoning Cache (RC) is a new way for AI to think in steps: it writes some thoughts, makes a short summary, throws away the long thoughts, and then keeps going using only the summary.

#Reasoning Cache#iterative decoding#summary-conditioned reasoning

Not triaged yet

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Intermediate

Yao Tang, Li Dong et al.Jan 13arXiv

The paper introduces Multiplex Thinking, a new way for AI to think by sampling several likely next words at once and blending them into a single super-token.

#Multiplex Thinking#chain-of-thought#continuous token

Not triaged yet