DEER: Draft with Diffusion, Verify with Autoregressive Models
IntermediateZicong Cheng, Guo-Wei Yang et al.Dec 17arXiv
DEER is a new way to speed up big language models by letting a diffusion model draft many tokens at once and an autoregressive model double-check them.
#DEER#speculative decoding#diffusion LLM