LLM Reference

Llama 2 7B Chat

About

The Llama 2 7B Chat model is a fine-tuned variant of Meta's Llama 2 series, optimized for conversational AI applications. Built on an auto-regressive transformer architecture, it boasts 7 billion parameters and has been trained on a diverse dataset of 2 trillion tokens. The model underwent supervised fine-tuning and reinforcement learning with human feedback to enhance its performance in dialogue scenarios. It demonstrates competitive capabilities in terms of helpfulness and safety compared to both open-source and closed-source alternatives like ChatGPT and PaLM. Designed for commercial and research use, particularly in English language tasks, it's well-suited for developing chatbots, virtual assistants, and other interactive AI systems. More details can be found on its Hugging Face page .

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(10)

ProviderInput (per 1M)Output (per 1M)Type
Alibaba Cloud PAI-EAS
Serverless
Baseten API
Serverless
Replicate API
Serverless
Fireworks AI Platform
Provisioned
Azure OpenAI$0.52$0.67
Serverless
Provisioned
GCP Vertex AI
Serverless
Cloudflare Workers AI
Serverless
deepinfra API
Serverless
Lepton AI API
Serverless
Together AI API$0.2$0.2
Serverless

Specifications

FamilyLlama 2
Released2023-07-18
Parameters7B
Context4K
ArchitectureDecoder Only
Specializationgeneral