Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models
IntermediateZiwei Luo, Ziqi Jin et al.Feb 2arXiv
The paper introduces a new way to sample text from masked diffusion language models that is smarter and less greedy.
#masked diffusion language models#sequential Monte Carlo#self-rewarding sampling