LLM ReferenceLLM Reference
ARCactiveReasoning

AI2 Reasoning Challenge

Metric: Accuracy (higher is better)Introduced: 2018

About

Grade-school science multiple-choice questions partitioned into Easy and Challenge (hard) sets. ARC-Challenge is the standard evaluation variant.