LLM ReferenceLLM Reference
CRASSactive

Counterfactual Reasoning Assessment

Metric: Accuracy (higher is better)Introduced: 2022

About

Counterfactual reasoning benchmark testing LLMs on hypothetical scenarios and their logical implications.

Resources

WebsiteGitHub