Classroom Final Exam: An Instructor-Tested Reasoning Benchmark
IntermediateChongyang Gao, Diji Yang et al.Feb 23arXiv
CFE-BENCH is a new, teacher-verified "Classroom Final Exam" for AI that uses real college STEM problems to test deep, step-by-step reasoning.
#CFE-BENCH#variable-based verification#reasoning flow