The paper teaches large language models to learn from detailed feedback (like error messages) instead of only a simple pass/fail score.
This paper shows a simple way for AI models to keep learning new things without forgetting what they already know.
This paper explains how to turn large language models (LLMs) from quiet students that only answer questions into active agents that can plan, act, and learn over time.
The paper introduces Nested Learning, a new way to build AI that learns in layers (like Russian dolls), so each part can update at its own speed and remember different things.
This paper introduces LAMER, a Meta-RL training framework that teaches language agents to explore first and then use what they learned to solve tasks faster.