LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
BeginnerChenkai Xu, Yijie Jin et al.Dec 18arXiv
This paper speeds up diffusion language models (dLLMs) by changing the order in which they fill in missing words.
#Diffusion LLM#Parallel decoding#Token Filling Order