Llama 3 70B
Llama 3 70B is worth evaluating for coding and classification when its provider route and context window match the workload.
Use it for
- Teams evaluating coding and classification
- Workloads that can use a 8k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Llama 3
- Released
- 2024-04-18
- Context
- 8k
- Parameters
- 70B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2023-12
- Specialization
- general
- Openness
- Open weights
- License
- Llama 3 CommunityCommercial use with conditions
- Training
- finetuned
Large-scale open-source AI for social technologies.
Cheapest of 1 route · Replicate API
About
The Llama 3 70B model is a state-of-the-art large language model with 70 billion parameters, released by Meta on April 18, 2024. It's based on an auto-regressive transformer architecture and has been optimized for dialogue applications using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). The model supports an 8,000-token context length and has been trained on over 15 trillion tokens from public online sources. It excels in tasks such as conversational AI, text generation, and natural language understanding, outperforming many existing open-source chat models on industry benchmarks. The model is designed with a focus on safety and helpfulness, making it suitable for both commercial and research applications, particularly in English. For more details, visit the Hugging Face link .
Llama 3 70B is an open-weight model in the Llama 3 family. The structured metadata tracks a 8k-token context window. This page tracks provider routes through Replicate API, with the cheapest tracked route listed at $0.65 input and $2.75 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 44.1, HellaSwag 92.4, and HumanEval 72.6.
Top use-case fit: coding, agents, and build tasks
Coding
Q/$ D1 relevant benchmark in the decision map.
Classification
Q/$ D2 relevant benchmarks in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Replicate API | $0.650 | $2.75 | Serverless |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
Benchmark scores(7)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 44.1 | diamond | research |
| HellaSwag | 92.4 | 10-shot | research |
| HumanEval | 72.6 | pass@1 | research |
| Massive Multitask Language Understanding | 80.5 | 5-shot | research |
| Grade School Math 8K | 93.0 | — | https://arxiv.org/abs/2407.21783 |
| BIG-Bench Hard | 83.2 | — | https://arxiv.org/abs/2407.21783 |
| AI2 Reasoning Challenge | 94.8 | — | https://arxiv.org/abs/2407.21783 |
Migration checks
No linked migration route is available for this model yet.
Rankings & picks(1)
Comparison and alternatives
Browse all comparisons →Large-scale open-source AI for social technologies.
Cheapest of 1 route · Replicate API