Holistic
HELM (Holistic Evaluation of Language Models)
About
Comprehensive evaluation framework covering accuracy, fairness, robustness, efficiency, calibration, and safety across 30+ scenarios.
Comprehensive evaluation framework covering accuracy, fairness, robustness, efficiency, calibration, and safety across 30+ scenarios.