Llama Guard 7B
About
Llama Guard 7B is a specialized content moderation model based on the Llama 2 architecture, designed to safeguard AI interactions. With 7 billion parameters, it excels in classifying and moderating both input prompts and output responses from large language models. The model employs a comprehensive risk taxonomy to identify various categories of harmful content, including violence, hate speech, and sexual content. Trained on diverse datasets, including prompts from the Anthropic dataset and in-house generated responses, Llama Guard 7B has demonstrated superior performance compared to industry-standard content moderation APIs. This makes it an invaluable tool for AI engineers focused on deploying safe and responsible AI systems. For more information, visit the model's page on Hugging Face .
Capabilities
Providers(3)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Cloudflare Workers AI | — | — | Serverless | |
| Together AI API | $0.2 | $0.2 | Serverless | |
| Fireworks AI Platform | — | — | Provisioned |