Papers5

#autoregressive models

Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Harold Haodong Chen, Xinxiang Yin et al.Feb 2arXiv

LatentMorph teaches an image-making AI to quietly think in its head while it draws, instead of stopping to write out its thoughts in words.

#latent reasoning#text-to-image generation#autoregressive models

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Intermediate

Bin Lin, Zongjian Li et al.Jan 23arXiv

This paper fixes a hidden flaw in a popular image tokenizer (FSQ) with a simple one-line change to its activation function.

#image generation#finite scalar quantization#iFSQ

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Intermediate

Seijin Kobayashi, Yanick Schimpf et al.Dec 23arXiv

The paper shows that big sequence models (like transformers) quietly learn longer goals inside their hidden activations, even though they are trained one step at a time.

#hierarchical reinforcement learning#temporal abstractions#autoregressive models

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Intermediate

Lanxiang Hu, Siqi Kou et al.Dec 16arXiv

Autoregressive (AR) models normally write one token at a time, which is accurate but slow for long answers.

#Jacobi Forcing#Jacobi decoding#consistency distillation

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Intermediate

Yonggan Fu, Lexington Whalen et al.Dec 16arXiv

Autoregressive (AR) models write one word at a time, which is accurate but slow, especially when your computer or GPU can’t keep many tasks in memory at once.

#diffusion language models#autoregressive models#AR-to-dLM conversion