Fast and Accurate Causal Parallel Decoding using Jacobi Forcing
IntermediateLanxiang Hu, Siqi Kou et al.Dec 16arXiv
Autoregressive (AR) models normally write one token at a time, which is accurate but slow for long answers.
#Jacobi Forcing#Jacobi decoding#consistency distillation