Meta-RL Induces Exploration in Language Agents
IntermediateYulun Jiang, Liangze Jiang et al.Dec 18arXiv
This paper introduces LAMER, a Meta-RL training framework that teaches language agents to explore first and then use what they learned to solve tasks faster.
#Meta-Reinforcement Learning#Language Agents#Exploration vs Exploitation