Concepts7

Groups

Depth vs Width Tradeoffs

Depth adds compositional power: stacking layers lets neural networks represent functions with many repeated patterns using far fewer neurons than a single wide layer.

#depth vs width#relu#piecewise linear+12

📚TheoryIntermediate

Empirical Risk Minimization

Empirical Risk Minimization (ERM) chooses a model that minimizes the average loss on the training data.

#empirical risk minimization

Concepts7

Depth vs Width Tradeoffs

Empirical Risk Minimization

Neural Network Expressivity

Statistical Learning Theory

PAC Learning

VC Dimension

Rademacher Complexity