OCR is like reading a page exactly as it is, and that strictness makes it perfect for fast, parallel generation.
LLaDA2.1 teaches a diffusion-style language model to write fast rough drafts and then fix its own mistakes by editing tokens it already wrote.
Diffusion language models can write tokens in any order, but that freedom can accidentally hurt their ability to reason well.