LLM Reference

Llama Guard 7B

About

Llama Guard 7B is a specialized content moderation model based on the Llama 2 architecture, designed to safeguard AI interactions. With 7 billion parameters, it excels in classifying and moderating both input prompts and output responses from large language models. The model employs a comprehensive risk taxonomy to identify various categories of harmful content, including violence, hate speech, and sexual content. Trained on diverse datasets, including prompts from the Anthropic dataset and in-house generated responses, Llama Guard 7B has demonstrated superior performance compared to industry-standard content moderation APIs. This makes it an invaluable tool for AI engineers focused on deploying safe and responsible AI systems. For more information, visit the model's page on Hugging Face .

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(3)

ProviderInput (per 1M)Output (per 1M)Type
Cloudflare Workers AI
Serverless
Together AI API$0.2$0.2
Serverless
Fireworks AI Platform
Provisioned

Specifications

Released2023-12-07
Parameters7B
Context2K
ArchitectureDecoder Only
Specializationgeneral