Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models
BeginnerShufan Li, Jiuxiang Gu et al.Dec 16arXiv
Sparse-LaViDa makes diffusion-style AI models much faster by skipping unhelpful masked tokens during generation while keeping quality the same.
#Masked Discrete Diffusion#Sparse Parameterization#Register Tokens