Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
IntermediateZehao Chen, Gongxun Li et al.Feb 9arXiv
Big language models can get stuck after fine-tuning because they become too sure of themselves, so normal training stops helping.
#weak-driven learning#logit mixing#weak agents