AGIEvalactiveComposite
Artificial General Intelligence Eval
Metric: Accuracy (higher is better)Introduced: 2023
About
Evaluation suite based on 20 official human-centric standardized exams (SAT, LSAT, GRE, bar exam) spanning English and Chinese.
Evaluation suite based on 20 official human-centric standardized exams (SAT, LSAT, GRE, bar exam) spanning English and Chinese.