Concepts12

Groups

Feature Learning vs Kernel Regime

The kernel (lazy) regime keeps neural network parameters close to their initialization, making training equivalent to kernel regression with a fixed kernel such as the Neural Tangent Kernel (NTK).

#neural tangent kernel#kernel ridge regression#lazy training+12

📚TheoryIntermediate

Grokking & Delayed Generalization

Grokking is when a model suddenly starts to generalize well long after it has already memorized the training set.

#grokking

Concepts12

Feature Learning vs Kernel Regime

Grokking & Delayed Generalization

Mean Field Theory of Neural Networks

Information Bottleneck in Deep Learning

Generalization Bounds for Deep Learning

Implicit Bias of Gradient Descent

Lottery Ticket Hypothesis

Double Descent Phenomenon

Neural Tangent Kernel (NTK)

Depth vs Width Tradeoffs

Scaling Laws

Universal Approximation Theorem