Llama 3.3 70B Instruct
Llama 3.3 70B Instruct is worth evaluating for rag, long context, and classification when its provider route and context window match the workload.
Use it for
- Teams evaluating rag, long context, and classification
- Workloads that can use a 128k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Family
- Llama 3.3
- Released
- 2025-09-01
- Context
- 128k
- Parameters
- 70B
- Knowledge cutoff
- 2023-12
- Openness
- Open weights
- License
- Llama 3 CommunityCommercial use: conditional
Large-scale open-source AI for social technologies.
Cheapest of 1 route · AWS Bedrock
About
Llama 3.3 70B Instruct is Meta's Llama 3.3 model. It offers a 128K-token context window with weights openly available for self-hosting.
Llama 3.3 70B Instruct is an open-weight model in the Llama 3.3 family. The structured metadata tracks a 128k-token context window and structured outputs. This page tracks provider routes through AWS Bedrock, with the cheapest tracked route listed at $0.96 input and $1.28 output per 1M tokens. Headline tracked benchmarks include BFCL 31.9.
Top use-case fit
RAG
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Classification
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| AWS Bedrock | $0.960 | $1.28 | Serverless |
Available via routers & gateways(1)
Capabilities
Benchmark peer barsfor JSON / Tool use
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| BFCL | 31.9 | — | https://gorilla.cs.berkeley.edu/leaderboard.html |
Migration checks
No linked migration route is available for this model yet.
Compare Llama 3.3 70B Instruct with other models
Frequently asked questions
What is the context window of Llama 3.3 70B Instruct?
Llama 3.3 70B Instruct has a context window of 128k tokens.
How much does Llama 3.3 70B Instruct cost?
Llama 3.3 70B Instruct is available at $0.96/1M input tokens through AWS Bedrock.
When was Llama 3.3 70B Instruct released?
Llama 3.3 70B Instruct was released on 2025-09-01.
Which providers offer Llama 3.3 70B Instruct?
Llama 3.3 70B Instruct is available from 1 provider: AWS Bedrock.
What benchmarks has Llama 3.3 70B Instruct been tested on?
Llama 3.3 70B Instruct has been evaluated on 1 benchmark, including BFCL.
Large-scale open-source AI for social technologies.
Cheapest of 1 route · AWS Bedrock