LLM Reference

Llama 3.1 8B Instruct

About

The Llama 3.1 8B Instruct model, released on July 23, 2024, is a multilingual large language model with 8 billion parameters, optimized for instruction-following tasks. It features an enhanced transformer architecture, supporting languages like English, German, French, and others. The model excels in dialogue applications, having been fine-tuned using supervised fine-tuning and reinforcement learning with human feedback. Trained on approximately 15 trillion tokens with a December 2023 data cutoff, it outperforms many existing open-source and closed chat models in various benchmarks. Ideal for commercial and research applications such as conversational agents and content generation, the model can be accessed on Hugging Face .

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(8)

ProviderInput (per 1M)Output (per 1M)Type
OctoAI API$0.15$0.15
Serverless
Together AI API$0.18$0.18
Serverless
Fireworks AI Platform$0.2$0.2
Serverless
NVIDIA NIM
Provisioned
GroqCloud$0.05$0.08
Serverless
Azure OpenAI$0.3$0.61
Provisioned
Databricks Foundation Model Serving
Provisioned
Hyperbolic AI Inference
Serverless

Specifications

FamilyLlama 3.1
Released2024-07-23
Parameters8B
Context128K
ArchitectureDecoder Only
Specializationgeneral