AVMeme Exam is a new test made by humans that checks if AI can understand famous internet audio and video clips the way people do.
FIN-bench-v2 is a big, tidy set of Finnish tests that checks how good large language models are at many things like reading, logic, and world knowledge.