Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules
BeginnerAmr Mohamed, Yang Zhang et al.Dec 2arXiv
Diffusion language models (dLLMs) can write all parts of an answer in parallel, but they usually take many tiny cleanup steps, which makes them slow.
#diffusion language models#early exit decoding#progress-aware threshold