The paper builds a simple, math-light rule to predict whether training makes a language model more open-minded (higher entropy) or more sure of itself (lower entropy).
Diffusion language models can write tokens in any order, but that freedom can accidentally hurt their ability to reason well.