The paper shows how to make AI think faster and smarter by planning in a hidden space instead of writing long step-by-step sentences.
SAGE is a two-agent system that automatically writes tough, multi-step search questions and checks them by actually trying to solve them.
RL-trained search agents often sound confident even when they donβt know, which can mislead people.