Small AI models often stumble when a tool call fails and then get stuck repeating bad calls instead of fixing the mistake.
RL-trained search agents often sound confident even when they donβt know, which can mislead people.