Concepts11

Groups

Sequence-to-Sequence with Attention

Sequence-to-sequence with attention lets a decoder focus on the most relevant parts of the input at each output step, rather than compressing everything into a single vector.

#sequence-to-sequence#attention#encoder-decoder+12

📚TheoryIntermediate

Key-Value Memory Systems

Key-Value memory systems store information as pairs where keys are used to look up values by similarity rather than exact match.

#key-value memory

Concepts11

Sequence-to-Sequence with Attention

Key-Value Memory Systems

In-Context Learning Theory

Positional Encoding Mathematics

Efficient Attention Mechanisms

Self-Attention as Graph Neural Network

Multi-Head Attention

Scaled Dot-Product Attention

Positional Encoding Theory

Transformer Theory

Attention Mechanism Theory