This paper teaches AI agents to make smart choices about when to explore for more information and when to act right away.
The paper discovers a tiny, special group of neurons inside large language models (LLMs) that act like a reward system in the human brain.