LLM Reference

Llama 3.1 70B Instruct

About

The Llama 3.1 70B Instruct model is a cutting-edge large language model with 70 billion parameters, designed for instruction-following tasks. It features multilingual capabilities, supporting languages like English, German, French, and others. Fine-tuned using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF), it excels in understanding and responding to user instructions. The model can handle a context length of up to 128k tokens, making it suitable for complex dialogue systems and applications requiring detailed responses. It outperforms many existing open-source and proprietary models on various industry benchmarks, making it ideal for conversational AI, content generation, and data synthesis tasks. For more details, visit the Hugging Face page [1].

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(8)

ProviderInput (per 1M)Output (per 1M)Type
OctoAI API$0.9$0.9
Serverless
Together AI API$0.88$0.88
Serverless
Fireworks AI Platform$0.9$0.9
Serverless
NVIDIA NIM
Provisioned
GroqCloud$0.59$0.79
Serverless
Azure OpenAI$2.68$3.54
Provisioned
Databricks Foundation Model Serving
Provisioned
Hyperbolic AI Inference
Serverless

Specifications

FamilyLlama 3.1
Released2024-07-23
Parameters70B
Context128K
ArchitectureDecoder Only
Specializationgeneral