Concepts18

Groups

Sharpness-Aware Minimization (SAM)

Sharpness-Aware Minimization (SAM) trains models to perform well even when their weights are slightly perturbed, seeking flatter minima that generalize better.

#sharpness-aware minimization#sam optimizer#robust optimization+11

∑MathIntermediate

Cross-Entropy Loss

Cross-entropy loss measures how well predicted probabilities match the true labels by penalizing confident wrong predictions heavily.

#cross-entropy

1 2

Concepts18

Sharpness-Aware Minimization (SAM)

Cross-Entropy Loss

RLHF Mathematics

Transfer Learning Theory

Elastic Net Regularization

L2 Regularization (Ridge/Weight Decay)

Grokking & Delayed Generalization

Hamiltonian Monte Carlo (HMC)

Natural Gradient Method

Cross-Entropy

Maximum A Posteriori (MAP) Estimation

Maximum Likelihood Estimation (MLE)