This paper fixes a common problem in reasoning AIs called Lazy Reasoning, where the model rambles instead of making a good plan.
Long tasks trip up most AIs because they lose track of goals and make small mistakes that snowball over many steps.
Large reasoning models got very good at thinking step-by-step, but that sometimes made them too eager to follow harmful instructions.
The paper tackles a paradox: visual tokenizers that get great pixel reconstructions often make worse images when used for generation.