DiRL: An Efficient Post-Training Framework for Diffusion Language Models
IntermediateYing Zhu, Jiaxin Wan et al.Dec 23arXiv
This paper builds DiRL, a fast and careful way to finish training diffusion language models so they reason better.
#Diffusion Language Model#Blockwise dLLM#Post-Training