LatentMorph teaches an image-making AI to quietly think in its head while it draws, instead of stopping to write out its thoughts in words.
This paper fixes a hidden flaw in a popular image tokenizer (FSQ) with a simple one-line change to its activation function.
The paper shows that big sequence models (like transformers) quietly learn longer goals inside their hidden activations, even though they are trained one step at a time.
Autoregressive (AR) models normally write one token at a time, which is accurate but slow for long answers.
Autoregressive (AR) models write one word at a time, which is accurate but slow, especially when your computer or GPU canβt keep many tasks in memory at once.