LLM Reference

Llama 3 70B Instruct

About

The Llama 3 70B Instruct model is a large language model with 70 billion parameters, released by Meta on April 18, 2024. It's an instruction-tuned variant optimized for conversational applications, utilizing an advanced auto-regressive transformer architecture. The model excels in following instructions and engaging in dialogue, having been trained on over 15 trillion tokens with a December 2023 knowledge cutoff. It demonstrates superior performance on industry benchmarks, scoring 82.0 on the MMLU (5-shot) test. The model incorporates extensive safety measures and optimizations, including RLHF, to enhance helpfulness and reduce harmful content generation. For more details, visit the model's Hugging Face page [1].

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(19)

ProviderInput (per 1M)Output (per 1M)Type
GCP Vertex AIServerless
AWS Bedrock$2.65$3.5Serverless
Azure OpenAI$3.78$11.34ServerlessProvisioned
NVIDIA NIMProvisioned
GroqCloud$0.59$0.79Serverless
deepinfra APIServerless
OctoAI API$0.9$0.9Serverless
Replicate APIServerless
Databricks Foundation Model Serving$1$3Serverless
Fireworks AI Platform$0.9$0.9Serverless
Baseten APIServerless
Lepton AI APIServerless
Snowflake Cortex$2.42$2.42Serverless
OCI Generative AIServerless
Together AI API$0.88$0.88Serverless
Perplexity LabsServerless
IBM watsonx$1.8$1.8Serverless
Scale AI GenAI PlatformServerless
Hyperbolic AI InferenceServerless

Benchmark Scores(2)

BenchmarkScoreVersionSource
HumanEval72.6pass@1Open LLM Leaderboard
Massive Multitask Language Understanding82.05-shotOpen LLM Leaderboard

Specifications

FamilyLlama 3
Released2024-04-18
Parameters70B
Context8K
ArchitectureDecoder Only
Specializationgeneral