Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
BeginnerZhihan Liu, Lin Guan et al.Jan 26arXiv
LLM agents are usually trained in a few worlds but asked to work in many different, unseen worlds, which often hurts their performance.
#cross-domain generalization#state information richness#planning complexity