NativeTok: Native Visual Tokenization for Improved Image Generation
IntermediateBin Wu, Mengqi Huang et al.Jan 30arXiv
This paper fixes a hidden mismatch in image generation: tokenizers make tokens without order, but generators need an order to predict the next token well.
#visual tokenization#autoregressive image generation#causal dependencies