Diffusion language models (dLLMs) generate several tokens at once but usually throw away lots of helpful clues each step—RCD keeps and reuses those clues.
This paper introduces PCED, a way to use many documents as separate 'experts' in parallel so an AI can stitch answers together without stuffing everything into one giant prompt.
The paper asks what a truly good diffusion-based language model should look like and lists five must-have properties.
Dream-VL and Dream-VLA use a diffusion language model backbone to understand images, talk about them, and plan actions better than many regular (autoregressive) models.
Autoregressive (AR) models normally write one token at a time, which is accurate but slow for long answers.
ReFusion is a new way for AI to write text faster by planning in chunks (called slots) and then filling each chunk carefully.
Before this work, most big language models talked one word at a time (autoregressive), which made them slow and hard to parallelize.
Diffusion language models write by gradually unmasking hidden words, so deciding which blanks to reveal next is a big deal for both speed and accuracy.