DSGym: A Holistic Framework for Evaluating and Training Data Science Agents
BeginnerFan Nie, Junlin Wang et al.Jan 22arXiv
DSGym is a unified 'gym' where AI data science agents are tested and trained by actually running code on real datasets, not just chatting about them.
#DSGym#data science agents#execution-grounded evaluation