Behavior Knowledge Merge in Reinforced Agentic Models
IntermediateXiangchi Yuan, Dachuan Shi et al.Jan 20arXiv
The paper solves a big problem: when you merge several reinforcement-learned models, their special skills get watered down by simple averaging.
#reinforcement learning#model merging#task vectors