This paper introduces SecCoderX, a way to teach code-writing AIs to be secure without breaking what the code is supposed to do.
The paper shows that three popular ways to control language models—fine-tuning a few weights, LoRA, and activation steering—are actually the same kind of action: a dynamic weight update driven by a control knob.
ECHO-2 is a new way to train AI with reinforcement learning that keeps a small, central trainer busy while sending the easy, cheap work (rollouts) to many low-cost computers spread around the world.
Large language models often learn one-size-fits-all preferences, but people are different, so we need personalization.