LLM ReferenceLLM Reference

Llama 3 70B Instruct

llama3-70b-instruct

Open Source

About

The Llama 3 70B Instruct model is a large language model with 70 billion parameters, released by Meta on April 18, 2024. It's an instruction-tuned variant optimized for conversational applications, utilizing an advanced auto-regressive transformer architecture. The model excels in following instructions and engaging in dialogue, having been trained on over 15 trillion tokens with a December 2023 knowledge cutoff. It demonstrates superior performance on industry benchmarks, scoring 82.0 on the MMLU (5-shot) test. The model incorporates extensive safety measures and optimizations, including RLHF, to enhance helpfulness and reduce harmful content generation. For more details, visit the model's Hugging Face page [1].

Llama 3 70B Instruct has a 8K-token context window.

Llama 3 70B Instruct input tokens at $0.4/1M, output at $0.4/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(18)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
GCP Vertex AI$1.20$3.60Serverless
AWS Bedrock$2.65$3.5Serverless
Microsoft Foundry$3.78$11.34ServerlessProvisioned
NVIDIA NIMProvisioned
DeepInfra$0.45$0.65Serverless
OctoAI API$0.9$0.9Serverless
Databricks Foundation Model Serving$1$3Serverless
Fireworks AI$0.9$0.9Serverless
Baseten APIServerless
Lepton AI API$0.80$0.80Serverless
OCI Generative AIServerless
Together AI$0.88$0.88Serverless
Perplexity Labs$1.00$1.00Serverless
IBM watsonx$1.8$1.8Serverless
Scale AI GenAI PlatformServerless
Hyperbolic AI Inference$0.40$0.40Serverless
OpenRouter$0.51$0.74Serverless
Replicate API$0.65$2.75Serverless

Benchmark Scores(4)

BenchmarkScoreVersionSource
HumanEval72.6pass@1Open LLM Leaderboard
Massive Multitask Language Understanding82.05-shotOpen LLM Leaderboard
Instruction-Following Evaluation77.8v2https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
MMLU PRO57.4https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

FamilyLlama 3
Released2024-04-18
Parameters70B
Context8K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuned

Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website