Groups
Category
Bellman equations express how the value of a state or action equals immediate reward plus discounted value of what follows.
Reinforcement Learning (RL) studies how an agent learns to act in an environment to maximize long-term cumulative reward.