Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
IntermediateYonggan Fu, Lexington Whalen et al.Dec 16arXiv
Autoregressive (AR) models write one word at a time, which is accurate but slow, especially when your computer or GPU canβt keep many tasks in memory at once.
#diffusion language models#autoregressive models#AR-to-dLM conversion