Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation
IntermediatePingzhi Tang, Yiding Wang et al.Jan 16arXiv
Big language models can learn new facts with simple tutoring (SFT), but that doesnβt automatically teach them how to use those facts well.
#Parametric Skill Transfer#Skill Vector#Task Arithmetic