Large reasoning models got very good at thinking step-by-step, but that sometimes made them too eager to follow harmful instructions.
This paper introduces Foundation-Sec-8B-Reasoning, a small (8 billion parameter) AI model that is trained to βthink out loudβ before answering cybersecurity questions.