LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation
BeginnerZhiling Yan, Dingjie Song et al.Feb 10arXiv
LiveMedBench is a new, always-updating test for medical AIs that keeps test questions safely separated from training data to avoid cheating by memorization.
#LiveMedBench#medical benchmark#data contamination