GameDevBench: Evaluating Agentic Capabilities Through Game Development
IntermediateWayne Chi, Yixiong Fang et al.Feb 11arXiv
GameDevBench is a new test that checks if AI agents can actually make parts of video games, not just write code in one file.
#GameDevBench#Godot#multimodal agents