EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning
IntermediateMingyang Wei, Dehai Min et al.Jan 6arXiv
EpiQAL is a new benchmark that tests how well AI models answer population-level disease questions using real research papers.
#Epidemiological reasoning#Question answering#Benchmarking LLMs