The paper introduces LT-Tuning, a way for AI models to “think silently” using special hidden tokens instead of writing every step out loud.
Small AI models often stumble when a tool call fails and then get stuck repeating bad calls instead of fixing the mistake.