The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
BeginnerZanlin Ni, Shenzhi Wang et al.Jan 21arXiv
Diffusion language models can write tokens in any order, but that freedom can accidentally hurt their ability to reason well.
#diffusion language model#arbitrary order generation#autoregressive training