Llama 2 70B Chat
DeprecatedOpen Source
About
Llama 2 70B Chat is a large-scale language model with 70 billion parameters, designed for conversational AI applications. Released on July 18, 2023, it's part of Meta's Llama 2 family, featuring advanced transformer architecture optimized through supervised fine-tuning and reinforcement learning with human feedback. The model excels in generating human-like responses, outperforming many open-source alternatives and rivaling closed-source models like ChatGPT. Trained on 2 trillion tokens from diverse public sources, it's suitable for commercial and research applications in English, particularly for assistant-like functionalities. The model is available on Hugging Face for further exploration and implementation .
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(14)
Compare all →| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Databricks Foundation Model Serving | $0.5 | $1.5 | Serverless | |
| Microsoft Foundry | $1.54 | $1.77 | ServerlessProvisioned | |
| GCP Vertex AI | — | — | Serverless | |
| Alibaba Cloud PAI-EAS | — | — | Serverless | |
| AWS Bedrock | $1.95 | $2.56 | Serverless | |
| OCI Generative AI | — | — | Serverless | |
| NVIDIA NIM | — | — | Provisioned | |
| DeepInfra | $0.64 | $0.64 | Serverless | |
| Lepton AI API | $0.50 | $0.50 | Serverless | |
| Together AI | $0.9 | $0.9 | Serverless | |
| IBM watsonx | $1.8 | $1.8 | Serverless | |
| Scale AI GenAI Platform | — | — | Serverless | |
| Fireworks AI | $0.9 | $0.9 | Serverless | |
| Replicate API | $0.65 | $2.75 | Serverless |
Benchmark Scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multitask Language Understanding | 68.9 | 5-shot | Open LLM Leaderboard |