NE-Dreamer is a model-based reinforcement learning agent that skips rebuilding pixels and instead learns by predicting the next stepβs hidden features.
The paper shows that when we train with the popular InfoNCE contrastive loss, the learned features start to behave like they come from a Gaussian (bell-shaped) distribution.
This paper shows how to get strong text embeddings from decoder-only language models without any training.
This paper shows you can train a big text-to-image diffusion model directly on the features of a vision foundation model (like DINOv3) without using a VAE.