MMLUactiveGeneral
Massive Multitask Language Understanding
Metric: Accuracy (higher is better)Introduced: 2021
Superseded by: mmlu-pro
About
Tests LLMs on undergraduate to professional level knowledge across 57 subjects with 15,908 multiple-choice questions. Top models now saturate at 86–90%; MMLU-Pro is the recommended harder successor.