Concepts11

Groups

Softmax & Temperature Scaling

Softmax turns arbitrary real-valued scores (logits) into probabilities that sum to one.

Feature Learning vs Kernel Regime

The kernel (lazy) regime keeps neural network parameters close to their initialization, making training equivalent to kernel regression with a fixed kernel such as the Neural Tangent Kernel (NTK).

#neural tangent kernel

Concepts11

Softmax & Temperature Scaling

Feature Learning vs Kernel Regime

Neural Tangent Kernel (NTK)

Smooth Manifolds & Tangent Spaces

Lagrange Multipliers & Constrained Optimization

Implicit Differentiation & Implicit Function Theorem

Multivariable Chain Rule

Partial Derivatives

Matrix Calculus Fundamentals

Matrix Calculus

Determinant