Sparse Reward Subsystem in Large Language Models
IntermediateGuowei Xu, Mert Yuksekgonul et al.Feb 1arXiv
The paper discovers a tiny, special group of neurons inside large language models (LLMs) that act like a reward system in the human brain.
#value neurons#dopamine neurons#reward prediction error