The paper introduces a new way to sample text from masked diffusion language models that is smarter and less greedy.
Big models are often used to grade AI answers, but they are expensive, slow, and depend too much on tricky prompts.