Reasoning models often talk too much, and those extra words can actually make them more wrong.
This paper teaches a language model to think along several paths at the same time instead of one step after another.