ThinkRouter teaches a model to switch how it “thinks” based on how sure it feels, so it stays accurate without talking forever.
Golden Goose turns messy internet text into clean multiple-choice puzzles that computers can learn from and get automatic rewards for.
The paper introduces Entropy Sentinel, a simple way to watch how accurate an AI is by reading its “uncertainty heartbeat” during generation.