Scaling Behavior of Discrete Diffusion Language Models
IntermediateDimitri von RΓΌtte, Janis Fluri et al.Dec 11arXiv
This paper studies how a newer kind of language model, called a discrete diffusion language model (DLM), gets better as we give it more data, bigger models, and more compute.
#discrete diffusion#language models#scaling laws