EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
IntermediateXavier Hu, Jinxiang Xia et al.Feb 10arXiv
EcoGym is a new open test playground where AI agents run small businesses over many days to see if they can plan well for the long term.
#EcoGym#long-horizon planning#LLM agents