LLM ReferenceLLM Reference
BBQ AmbigactiveSafety

BBQ Ambig

Metric: Bias Score (higher is better)Introduced: 2022

About

Ambiguous context subset of the Bias Benchmark for QA (BBQ), measuring how models respond to social bias questions when context is underspecified.

Resources

Website