ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas
IntermediateXiaoyu Tian, Haotian Wang et al.Jan 29arXiv
ASTRA is a fully automated way to train tool-using AI agents by making both their practice stories (trajectories) and their practice worlds (environments) without humans in the loop.
#tool-augmented agents#multi-turn decision making#verifiable environments