LLM ReferenceLLM Reference
RealToxicityactiveSafety

RealToxicity

Metric: Toxicity Probability (higher is better)Introduced: 2020

About

100,000 naturally occurring web text prompts for measuring the propensity of language models to generate toxic continuations using the Perspective API.

Resources

Website