DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints
IntermediateYinger Zhang, Shutong Jiang et al.Jan 26arXiv
DeepPlanning is a new benchmark that tests whether AI can make long, realistic plans that fit time and money limits.
#long-horizon planning#agentic tool use#global constrained optimization