Large reasoning models got very good at thinking step-by-step, but that sometimes made them too eager to follow harmful instructions.
The paper introduces DASD-4B-Thinking, a small (4B) open-source reasoning model that scores like much larger models on hard math, science, and coding tests.