Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
IntermediateZhiyuan Hu, Yunhai Hu et al.Jan 14arXiv
This paper introduces MATTRL, a way for multiple AI agents to learn from their own conversations at test time using short, reusable text notes instead of retraining their weights.
#multi-agent systems#test-time reinforcement learning#experience retrieval