Concepts172

Groups

Transformer Expressiveness

Transformer expressiveness studies what kinds of sequence-to-sequence mappings a Transformer can represent or approximate.

#transformer expressiveness#universal approximation#self-attention+12

📚TheoryAdvanced

Feature Learning vs Kernel Regime

The kernel (lazy) regime keeps neural network parameters close to their initialization, making training equivalent to kernel regression with a fixed kernel such as the Neural Tangent Kernel (NTK).

#neural tangent kernel

1 2 3 4 5

Concepts172

Transformer Expressiveness

Feature Learning vs Kernel Regime

Mean Field Theory of Neural Networks

Information Bottleneck in Deep Learning

Generalization Bounds for Deep Learning

Neural Tangent Kernel (NTK)

Langevin Dynamics & Score-Based Sampling

Hamiltonian Monte Carlo (HMC)

Spectral Convolution on Graphs

Random Matrix Theory in High-Dimensional Statistics

Spectral Analysis of Neural Networks

Free Probability Theory