How I Study AI - Learn AI Papers & Lectures the Easy Way

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Arnas Uselis, Andrea Dittadi et al.Feb 27arXiv

The paper asks a simple question: what must a vision model’s internal pictures (embeddings) look like if it can recognize new mixes of things it already knows?

#compositional generalization#linear representation hypothesis#orthogonal representations

Not triaged yet

Sparse Reward Subsystem in Large Language Models

Intermediate

Guowei Xu, Mert Yuksekgonul et al.Feb 1arXiv

The paper discovers a tiny, special group of neurons inside large language models (LLMs) that act like a reward system in the human brain.

#value neurons#dopamine neurons#reward prediction error

Not triaged yet

Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

Intermediate

Mengqi He, Xinyu Tian et al.Dec 26arXiv

The paper shows that when vision-language models write captions, only a small set of uncertain words (about 20%) act like forks that steer the whole sentence.

#vision-language models#autoregressive generation#entropy

Not triaged yet

Papers3

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Sparse Reward Subsystem in Large Language Models

Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models