๐ŸŽ“How I Study AIHISA
๐Ÿ“–Read
๐Ÿ“„Papers๐Ÿ“ฐBlogs๐ŸŽฌCourses
๐Ÿ’กLearn
๐Ÿ›ค๏ธPaths๐Ÿ“šTopics๐Ÿ’กConcepts๐ŸŽดShorts
๐ŸŽฏPractice
๐Ÿ“Daily Log๐ŸŽฏPrompts๐Ÿง Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Concepts57

Groups

๐Ÿ“Linear Algebra15๐Ÿ“ˆCalculus & Differentiation10๐ŸŽฏOptimization14๐ŸŽฒProbability Theory12๐Ÿ“ŠStatistics for ML9๐Ÿ“กInformation Theory10๐Ÿ”บConvex Optimization7๐Ÿ”ขNumerical Methods6๐Ÿ•ธGraph Theory for Deep Learning6๐Ÿ”ตTopology for ML5๐ŸŒDifferential Geometry6โˆžMeasure Theory & Functional Analysis6๐ŸŽฐRandom Matrix Theory5๐ŸŒŠFourier Analysis & Signal Processing9๐ŸŽฐSampling & Monte Carlo Methods10๐Ÿง Deep Learning Theory12๐Ÿ›ก๏ธRegularization Theory11๐Ÿ‘๏ธAttention & Transformer Theory10๐ŸŽจGenerative Model Theory11๐Ÿ”ฎRepresentation Learning10๐ŸŽฎReinforcement Learning Mathematics9๐Ÿ”„Variational Methods8๐Ÿ“‰Loss Functions & Objectives10โฑ๏ธSequence & Temporal Models8๐Ÿ’ŽGeometric Deep Learning8

Category

๐Ÿ”ทAllโˆ‘Mathโš™๏ธAlgo๐Ÿ—‚๏ธDS๐Ÿ“šTheory

Level

AllBeginnerIntermediate
๐Ÿ“šTheoryAdvanced

Streaming Algorithm Theory

Streaming algorithms process massive data one pass at a time using sublinearโ€”often polylogarithmicโ€”memory.

#streaming algorithms#count-min sketch#misra-gries+12
๐Ÿ“šTheoryAdvanced

Distributed Algorithm Theory

Distributed algorithm theory studies how many independent computers cooperate correctly and efficiently despite delays and failures.

#distributed algorithms
12345
Advanced
#message passing
#synchronous rounds
+12
๐Ÿ“šTheoryAdvanced

Algorithmic Information Theory

Algorithmic Information Theory studies information content via the shortest programs that generate data, rather than via average-case probabilities.

#kolmogorov complexity#algorithmic probability#solomonoff induction+11
๐Ÿ“šTheoryAdvanced

Optimal Transport Theory

Optimal Transport (OT) formalizes the cheapest way to move one probability distribution into another given a cost to move mass.

#optimal transport#wasserstein distance#kantorovich+12
๐Ÿ“šTheoryAdvanced

Diffusion Models Theory

Diffusion models learn to reverse a simple noising process by estimating the score (the gradient of the log density) of data at different noise levels.

#diffusion models#ddpm#score matching+12
๐Ÿ“šTheoryAdvanced

Variational Inference Theory

Variational Inference (VI) replaces an intractable posterior with a simpler distribution and optimizes it by minimizing KL divergence, which is equivalent to maximizing the ELBO.

#variational inference#elbo#kl divergence+12
๐Ÿ“šTheoryAdvanced

P vs NP Problem

P vs NP asks whether every problem whose solutions can be verified quickly can also be solved quickly.

#p vs np#np-complete#np-hard+12
๐Ÿ“šTheoryAdvanced

GAN Theory

Generative Adversarial Networks (GANs) set up a two-player game where a generator tries to make fake samples that look real while a discriminator tries to tell real from fake.

#gan minimax#wasserstein gan#js divergence+11
๐Ÿ“šTheoryAdvanced

Representation Learning Theory

Representation learning aims to automatically discover features that make downstream tasks easy, often without human-provided labels.

#representation learning#contrastive learning#infonce+12
๐Ÿ“šTheoryAdvanced

Information Bottleneck Theory

Information Bottleneck (IB) studies how to compress an input X into a representation Z that still preserves what is needed to predict Y.

#information bottleneck#mutual information#variational information bottleneck+12
๐Ÿ“šTheoryAdvanced

Policy Gradient Theorem

The policy gradient theorem tells us how to push a stochastic policyโ€™s parameters to increase expected return by following the gradient of expected rewards.

#policy gradient#reinforce#actor-critic+11
๐Ÿ“šTheoryAdvanced

Transformer Theory

Transformers map sequences to sequences using layers of self-attention and feed-forward networks wrapped with residual connections and LayerNorm.

#transformer#self-attention#positional encoding+12