VA-$π$: Variational Policy Alignment for Pixel-Aware Autoregressive Generation
IntermediateXinyao Liao, Qiyuan He et al.Dec 22arXiv
Autoregressive (AR) image models make pictures by choosing tokens one-by-one, but they were judged only on picking likely tokens, not on how good the final picture looks in pixels.
#autoregressive image generation#tokenizer–generator alignment#pixel-space reconstruction