LLM Reference

Llama 3 70B Instruct

Released
2024-04-18
Last refreshed
2026-05-22
Status
Researched 55d ago
Open WeightsCommercial use with conditionsCodingClassificationJSON / Tool use

Llama 3 70B Instruct is worth evaluating for coding, classification, and json / tool use when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, classification, and json / tool use
  • Workloads that can use a 8k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
Llama 3
Released
2024-04-18
Context
8k
Parameters
70B
Architecture
Decoder Only
Knowledge cutoff
2023-12
Specialization
general
Openness
Open weights
License
Llama 3 CommunityCommercial use with conditions
Training
finetuned
Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website
Pricing
Output / 1M
$0.400
Input / 1M
$0.400

Cheapest of 18 routes · Hyperbolic AI Inference

About

The Llama 3 70B Instruct model is a large language model with 70 billion parameters, released by Meta on April 18, 2024. It's an instruction-tuned variant optimized for conversational applications, utilizing an advanced auto-regressive transformer architecture. The model excels in following instructions and engaging in dialogue, having been trained on over 15 trillion tokens with a December 2023 knowledge cutoff. It demonstrates superior performance on industry benchmarks, scoring 82.0 on the MMLU (5-shot) test. The model incorporates extensive safety measures and optimizations, including RLHF, to enhance helpfulness and reduce harmful content generation. For more details, visit the model's Hugging Face page [1].

Llama 3 70B Instruct is an open-weight model in the Llama 3 family. The structured metadata tracks a 8k-token context window and structured outputs. This page tracks provider routes through GCP Vertex AI, AWS Bedrock, Microsoft Foundry, and 15 more, with the cheapest tracked route listed at $0.4 input and $0.4 output per 1M tokens. Headline tracked benchmarks include HumanEval 72.6, Massive Multitask Language Understanding 82.0, and Instruction-Following Evaluation 77.8.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ B

1 relevant benchmark in the decision map.

Classification

Q/$ C

2 relevant benchmarks in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 18

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Hyperbolic AI Inference$0.400$0.400
Serverless
DeepInfra$0.450$0.650
Serverless
Novita AI$0.510$0.740
Serverless
OpenRouter$0.510$0.740
Serverless

Available via routers & gateways(16)

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeGCP Vertex AI

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft FoundryGCP Vertex AI

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionGCP Vertex AIMicrosoft Foundry

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSGCP Vertex AIMicrosoft Foundry

Capabilities

Structured Outputs

Benchmark peer barsfor Coding

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
HumanEval72.6pass@1https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding82.05-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Instruction-Following Evaluation77.8v2https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
MMLU PRO57.4https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Migration checks

No linked migration route is available for this model yet.

Show all 41 popular comparisonssorted by 7-day search impressions
Llama 3 70B Instruct vs Mistral Large 290Llama 3 70B Instruct vs Claude Sonnet 4.589Llama 3 70B Instruct vs Qwen3.5-122B-A10B86Llama 3 70B Instruct vs DeepSeek V386Llama 3 70B Instruct vs Claude Opus 4.668Llama 3 70B Instruct vs Qwen3.5-27B58Llama 3 70B Instruct vs Qwen2.5-72B56Llama 3 70B Instruct vs Grok-352Llama 3 70B Instruct vs Gemini 2.5 Pro49Llama 3 70B Instruct vs Llama 3.2 1B Instruct45Llama 3 70B Instruct vs Claude Haiku 4.545Llama 3 70B Instruct vs Claude Opus 4.543Llama 3 70B Instruct vs Qwen3-Max41Llama 3 70B Instruct vs GLM-5.140Llama 3 70B Instruct vs Llama 2 13B Chat36Llama 3 70B Instruct vs GPT-5.535Llama 3 70B Instruct vs DeepSeek R135Llama 3 70B Instruct vs GPT-5.427Llama 3 70B Instruct vs Kimi K2.526Llama 3 70B Instruct vs Gemma 7B Instruct25Llama 3 70B Instruct vs Phi 3.5 Mini Instruct23Llama 3 70B Instruct vs Mistral Nemotron23Llama 3 70B Instruct vs Kimi K2 Instruct22Llama 3 70B Instruct vs Qwen3.5-397B-A17B20Llama 3 70B Instruct vs Claude 3.7 Sonnet19Llama 3 70B Instruct vs Trinity-Large-Preview14Llama 3 70B Instruct vs o313Llama 3 70B Instruct vs Phi-4 Mini Flash Reasoning13Llama 3 70B Instruct vs DeepSeek V3.111Llama 3 70B Instruct vs Gemma 2 9B SahabatAI Instruct11Llama 3 70B Instruct vs GLM-510Llama 3 70B Instruct vs DeepSeek R1 05289Llama 3 70B Instruct vs Trinity-Large-Thinking8Llama 3 70B Instruct vs Llama Guard 3 1B8Llama 3 70B Instruct vs GPT-5.28Llama 3 70B Instruct vs ShieldGemma 9B7Llama 3 70B Instruct vs Code Davinci 0016Llama 3 70B Instruct vs Qwen3-235B-A22B5Llama 3 70B Instruct vs Qwen2-7B-Instruct4Llama 3 70B Instruct vs o3 Mini3Llama 3 70B Instruct vs GPT-5 Pro3