Concepts95

Groups

Autoregressive Models

Autoregressive (AR) models represent a joint distribution by multiplying conditional probabilities in a fixed order, using the chain rule of probability.

#autoregressive#ar model#n-gram+11

📚TheoryIntermediate

Maximum Likelihood & Generative Models

Maximum Likelihood Estimation (MLE) picks parameters that make the observed data most probable under a chosen probabilistic model.

#maximum likelihood

1 2 3 4 5

Concepts95

Autoregressive Models

Maximum Likelihood & Generative Models

Mixture of Experts (MoE)

Key-Value Memory Systems

Self-Attention as Graph Neural Network

Multi-Head Attention

Scaled Dot-Product Attention

Stochastic Depth

Spectral Regularization

Early Stopping

Label Smoothing

Data Augmentation Theory