OCR is like reading a page exactly as it is, and that strictness makes it perfect for fast, parallel generation.
Not triaged yet
LLaDA2.1 teaches a diffusion-style language model to write fast rough drafts and then fix its own mistakes by editing tokens it already wrote.
Diffusion language models can write tokens in any order, but that freedom can accidentally hurt their ability to reason well.