Groups
Category
Natural gradient scales the ordinary gradient by the inverse Fisher information matrix to account for the geometry of probability distributions.
The policy gradient theorem tells us how to push a stochastic policyโs parameters to increase expected return by following the gradient of expected rewards.