LLM ReferenceLLM Reference
SQuADactive

Stanford Question Answering Dataset

Metric: Exact Match / F1 (higher is better)Introduced: 2016

About

100,000+ reading comprehension questions on 500+ Wikipedia articles. SQuAD 2.0 (2018) added unanswerable questions. Both versions remain widely cited.