Llama 3.1 8B Instruct
Open Source
About
The Llama 3.1 8B Instruct model, released on July 23, 2024, is a multilingual large language model with 8 billion parameters, optimized for instruction-following tasks. It features an enhanced transformer architecture, supporting languages like English, German, French, and others. The model excels in dialogue applications, having been fine-tuned using supervised fine-tuning and reinforcement learning with human feedback. Trained on approximately 15 trillion tokens with a December 2023 data cutoff, it outperforms many existing open-source and closed chat models in various benchmarks. Ideal for commercial and research applications such as conversational agents and content generation, the model can be accessed on Hugging Face .
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(12)
Compare all →| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| OctoAI API | $0.15 | $0.15 | Serverless | |
| Together AI | $0.18 | $0.18 | Serverless | |
| Fireworks AI | $0.2 | $0.2 | Serverless | |
| NVIDIA NIM | — | — | Provisioned | |
| GroqCloud | $0.05 | $0.08 | Serverless | |
| Microsoft Foundry | $0.3 | $0.61 | Provisioned | |
| Databricks Foundation Model Serving | — | — | Provisioned | |
| Hyperbolic AI Inference | $0.10 | $0.10 | Serverless | |
| OpenRouter | $0.02 | $0.05 | Serverless | |
| IBM watsonx | $0.15 | $0.5 | Serverless | |
| AWS Bedrock | — | — | Serverless | |
| Replicate API | $0.25 | $0.25 | Serverless |