Groups
Category
In underdetermined linear systems (more variables than equations), gradient descent started at zero converges to the minimum Euclidean norm solution without any explicit regularizer.
Gradient clipping limits how large gradient values or their overall magnitude can become during optimization to prevent exploding updates.