Llama 3.1 8B Instruct
About
The Llama 3.1 8B Instruct model, released on July 23, 2024, is a multilingual large language model with 8 billion parameters, optimized for instruction-following tasks. It features an enhanced transformer architecture, supporting languages like English, German, French, and others. The model excels in dialogue applications, having been fine-tuned using supervised fine-tuning and reinforcement learning with human feedback. Trained on approximately 15 trillion tokens with a December 2023 data cutoff, it outperforms many existing open-source and closed chat models in various benchmarks. Ideal for commercial and research applications such as conversational agents and content generation, the model can be accessed on Hugging Face .
Capabilities
MultimodalFunction CallingTool UseJSON Mode
Providers(8)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| OctoAI API | $0.15 | $0.15 | Serverless | |
| Together AI API | $0.18 | $0.18 | Serverless | |
| Fireworks AI Platform | $0.2 | $0.2 | Serverless | |
| NVIDIA NIM | — | — | Provisioned | |
| GroqCloud | $0.05 | $0.08 | Serverless | |
| Azure OpenAI | $0.3 | $0.61 | Provisioned | |
| Databricks Foundation Model Serving | — | — | Provisioned | |
| Hyperbolic AI Inference | — | — | Serverless |
Specifications
FamilyLlama 3.1
Released2024-07-23
Parameters8B
Context128K
ArchitectureDecoder Only
Specializationgeneral