Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning
IntermediateJinyang Wu, Shuo Yang et al.Jan 28arXiv
SPARK is a new way to train AI agents that saves compute by exploring more only at the most important moments.
#SPARK#dynamic branching#strategic exploration