ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
IntermediateJie Yang, Honglin Guo et al.Jan 16arXiv
ABC-Bench is a new test that checks if AI coding agents can really do backend work from start to finish, not just write a few lines of code.
#ABC-Bench#agentic backend coding#end-to-end API testing