Category
Level
Reinforcement Learning (RL) studies how an agent learns to act in an environment to maximize long-term cumulative reward.