LLM ReferenceLLM Reference
MMLUactiveGeneral

Massive Multitask Language Understanding

Metric: Accuracy (higher is better)Introduced: 2021

Superseded by: mmlu-pro

About

Tests LLMs on undergraduate to professional level knowledge across 57 subjects with 15,908 multiple-choice questions. Top models now saturate at 86–90%; MMLU-Pro is the recommended harder successor.