Golden Goose turns messy internet text into clean multiple-choice puzzles that computers can learn from and get automatic rewards for.
The paper introduces Entropy Sentinel, a simple way to watch how accurate an AI is by reading its “uncertainty heartbeat” during generation.