TruthfulQAactiveReasoning
TruthfulQA
Metric: TruthfulQA Score (higher is better)Introduced: 2021
About
817 questions spanning 38 categories where humans commonly hold misconceptions. Measures whether models give truthful answers or reproduce popular falsehoods.