How I Study AI - Learn AI Papers & Lectures the Easy Way

NativeTok: Native Visual Tokenization for Improved Image Generation

Intermediate

Bin Wu, Mengqi Huang et al.Jan 30arXiv

This paper fixes a hidden mismatch in image generation: tokenizers make tokens without order, but generators need an order to predict the next token well.

#visual tokenization#autoregressive image generation#causal dependencies

VA-$π$: Variational Policy Alignment for Pixel-Aware Autoregressive Generation

Intermediate

Xinyao Liao, Qiyuan He et al.Dec 22arXiv

Autoregressive (AR) image models make pictures by choosing tokens one-by-one, but they were judged only on picking likely tokens, not on how good the final picture looks in pixels.

#autoregressive image generation#tokenizer–generator alignment#pixel-space reconstruction

Spherical Leech Quantization for Visual Tokenization and Generation

Intermediate

Yue Zhao, Hanwen Jiang et al.Dec 16arXiv

This paper shows a simple, math-guided way to turn image pieces into tidy symbols (tokens) using points spread evenly on a sphere.

#Spherical Leech Quantization#Leech lattice#spherical codes

Papers3

NativeTok: Native Visual Tokenization for Improved Image Generation

VA-$π$: Variational Policy Alignment for Pixel-Aware Autoregressive Generation

Spherical Leech Quantization for Visual Tokenization and Generation