This paper introduces TAM-Eval, a new way to test how well AI models can create, fix, and update unit tests for real software projects.
The paper introduces UCoder, a way to teach a code-generating AI to get better without using any outside datasets, not even unlabeled code.