Concepts20

Groups

Rényi Entropy & Divergence

Rényi entropy generalizes Shannon entropy by measuring uncertainty with a tunable emphasis on common versus rare outcomes.

#renyi entropy#renyi divergence#shannon entropy+12

⚙️AlgorithmIntermediate

Mixed Precision Training

Mixed precision training stores and computes tensors in low precision (FP16/BF16) for speed and memory savings while keeping a master copy of weights in FP32 for accurate updates.

#mixed precision

1 2

Concepts20

Rényi Entropy & Divergence

Mixed Precision Training

Orthogonal & Unitary Matrices

State Space Models (SSM)

Focal Loss

Cross-Entropy Loss

Softmax & Temperature Scaling

Multi-Head Attention

Scaled Dot-Product Attention

Label Smoothing

Layer Normalization

Matrix Factorizations (Numerical)