Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors
BeginnerZhiwei Zhang, Fei Zhao et al.Jan 22arXiv
Small AI models often stumble when a tool call fails and then get stuck repeating bad calls instead of fixing the mistake.
#FISSION-GRPO#error recovery#tool use