Llama 3.3 70B Instruct

Name: Llama 3.3 70B Instruct
Author: AI at Meta

Released

2025-09-01

Last refreshed

2026-06-15

Status

Researched 28d ago

Open weightsCommercial use: conditionalRAGLong contextClassificationJSON / Tool use

Llama 3.3 70B Instruct is worth evaluating for rag, long context, and classification when its provider route and context window match the workload.

Use it for

Teams evaluating rag, long context, and classification
Workloads that can use a 128k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Llama 3.3
Released: 2025-09-01
Context: 128k
Parameters: 70B
Knowledge cutoff: 2023-12
Openness: Open weights
License: Llama 3 CommunityCommercial use: conditional

Created by

AI at Meta

Large-scale open-source AI for social technologies.

Menlo Park, California, United States

Founded 2013

Website

Pricing

Output / 1M

$1.28

Input / 1M

$0.960

Cheapest of 1 route · AWS Bedrock

Providers(1)

AWS Bedrock

View 1 provider route

About

Llama 3.3 70B Instruct is Meta's Llama 3.3 model. It offers a 128K-token context window with weights openly available for self-hosting.

Llama 3.3 70B Instruct is an open-weight model in the Llama 3.3 family. The structured metadata tracks a 128k-token context window and structured outputs. This page tracks provider routes through AWS Bedrock, with the cheapest tracked route listed at $0.96 input and $1.28 output per 1M tokens. Headline tracked benchmarks include BFCL 31.9.

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
AWS Bedrock	$0.960	$1.28	Serverless

Available via routers & gateways(1)

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Capabilities

Structured Outputs

Benchmark peer barsfor JSON / Tool use

BFCLRank 13 of 17

77.5

73.2

72.9

72.5

Llama 3.3 70B Instructcurrent

31.9

Benchmark scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.