LLMs can think for many steps, but when they keep every step forever, the extra tokens turn into noise and make answers worse, not better.
Coding agents waste most of their tokens just reading giant files, which makes them slow and expensive.