Groups
Category
Value function approximation replaces a huge table of values with a small set of parameters that can generalize across similar states.
Bellman equations express how the value of a state or action equals immediate reward plus discounted value of what follows.