Concepts14

Groups

Sharpness-Aware Minimization (SAM)

Sharpness-Aware Minimization (SAM) trains models to perform well even when their weights are slightly perturbed, seeking flatter minima that generalize better.

#sharpness-aware minimization#sam optimizer#robust optimization+11

∑MathIntermediate

Cross-Entropy Loss

Cross-entropy loss measures how well predicted probabilities match the true labels by penalizing confident wrong predictions heavily.

#cross-entropy

1 2

Concepts14

Sharpness-Aware Minimization (SAM)

Cross-Entropy Loss

RLHF Mathematics

Elastic Net Regularization

L2 Regularization (Ridge/Weight Decay)

Grokking & Delayed Generalization

Cross-Entropy

Maximum A Posteriori (MAP) Estimation

Maximum Likelihood Estimation (MLE)

Loss Landscape Analysis

Momentum Methods

Stochastic Gradient Descent (SGD)