FIN-bench-v2: A Unified and Robust Benchmark Suite for Evaluating Finnish Large Language Models
IntermediateJoona Kytöniemi, Jousia Piha et al.Dec 15arXiv
FIN-bench-v2 is a big, tidy set of Finnish tests that checks how good large language models are at many things like reading, logic, and world knowledge.
#Finnish language models#benchmark suite#HuggingFace Datasets