Groups
Category
Empirical Risk Minimization (ERM) chooses a model that minimizes the average loss on the training data.
Gradient descent is a simple, repeatable way to move downhill on a loss surface by stepping in the opposite direction of the gradient.