LLM Reference
Holistic

HELM (Holistic Evaluation of Language Models)

About

Comprehensive evaluation framework covering accuracy, fairness, robustness, efficiency, calibration, and safety across 30+ scenarios.