The paper introduces a new way to sample text from masked diffusion language models that is smarter and less greedy.
BatCoder teaches a code model to write both code and its documentation by doing a round trip: from code to docs and back to code.
Stable-DiffCoder is a code-focused diffusion language model that learns to write and edit programs by filling in masked pieces, not just predicting the next token.
DEER is a new way to speed up big language models by letting a diffusion model draft many tokens at once and an autoregressive model double-check them.