Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
BeginnerEmre Can Acikgoz, Cheng Qian et al.Feb 24arXiv
Tool-R0 teaches a language model to use software tools (like APIs) with zero human-made training data.
#self-play reinforcement learning#tool calling#function calling