Llama 3.3 70B Instruct (free)

Name: Llama 3.3 70B Instruct (free)
Author: AI at Meta

Released

2024-12-06

Last refreshed

2026-06-30

Status

Researched 31d ago

Open weightsCommercial use: conditionalClassificationJSON / Tool use

Llama 3.3 70B Instruct (free) is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

Teams evaluating classification and json / tool use
Workloads that can use a 66k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Llama 3.3
Released: 2024-12-06
Context: 66k
Parameters: 70B
Architecture: Decoder Only
Knowledge cutoff: 2023-12
Specialization: general
Openness: Open weights
License: Llama 3 CommunityCommercial use: conditional

Created by

AI at Meta

Large-scale open-source AI for social technologies.

Menlo Park, California, United States

Founded 2013

Website

Pricing

Output / 1M

$0.320

Input / 1M

$0.100

Cheapest of 11 routes · OpenRouter

Providers(11)

Cloudflare Workers AI NVIDIA NIM GroqCloud Together AI Arcee AI Novita AI Chutes AI OpenRouter AWS Bedrock Microsoft Foundry Vercel AI Gateway

View 11 provider routes

Links

Website

About

Meta: Llama 3.3 70B Instruct (free) available via OpenRouter. Pricing: $null/1M input, $null/1M output.

Llama 3.3 70B Instruct (free) is the OpenRouter free-tier entry for Meta's instruction-tuned 70B Llama 3.3 model. The underlying model was released on December 6, 2024 and is designed to deliver much of the Llama 3.1 405B instruction-following quality at a lower serving cost. This seed row, however, is not the canonical full-context model profile: it represents a hosted free listing with a 66K-token context window and no listed token price in the page data.

The free listing is best understood as an evaluation and prototyping path. It can cover chat, coding assistance, multilingual prompts, structured output experiments, and lightweight reasoning tests, but provider-side rate limits and context caps matter more than they do on paid endpoints. Teams that need the full 128K-class Llama 3.3 deployment, a license/self-hosting decision, or production throughput should compare the paid provider rows or the canonical non-free Llama 3.3 entry instead of treating this free SKU as the whole model story.

The price ladder for this slug includes hosted options such as OpenRouter, Groq, Together AI, NVIDIA NIM, Novita AI, AWS Bedrock, Microsoft Foundry, Vercel AI Gateway, and other providers. Use the free OpenRouter URL for quick trials; use the paid rows when context, reliability, and support terms are part of the decision.

Llama 3.3 70B Instruct (free) has a 66k-token context window.

Llama 3.3 70B Instruct (free) input tokens at $0.1/1M, output at $0.32/1M.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 11

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
OpenRouter	$0.100	$0.320	Serverless
Novita AI	$0.135	$0.400	Serverless
Chutes AI	$0.220	$0.660	Serverless
Microsoft Foundry	$0.710	$0.710	ServerlessProvisioned

Available via routers & gateways(8)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughTogether AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionMicrosoft Foundry

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft Foundry

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Compare Llama 3.3 70B Instruct (free) with other models

Comparison and alternatives

Browse all comparisons →

Llama 3.3 70B Instruct (free) vs Qwen2.5-72B-Instruct Llama 3.3 70B Instruct (free) vs ShieldGemma 9B

Frequently asked questions

What is the context window of Llama 3.3 70B Instruct (free)?

Llama 3.3 70B Instruct (free) has a context window of 66k tokens.

How much does Llama 3.3 70B Instruct (free) cost?

Llama 3.3 70B Instruct (free) pricing ranges from $0.1/1M to $1.04/1M input tokens depending on the provider.

When was Llama 3.3 70B Instruct (free) released?

Llama 3.3 70B Instruct (free) was released on 2024-12-06.

Which providers offer Llama 3.3 70B Instruct (free)?

Llama 3.3 70B Instruct (free) is available from 11 providers: Cloudflare Workers AI, NVIDIA NIM, GroqCloud, Together AI, Arcee AI, Novita AI, Chutes AI, OpenRouter, AWS Bedrock, Microsoft Foundry, Vercel AI Gateway.

Created by

AI at Meta

Large-scale open-source AI for social technologies.

Menlo Park, California, United States

Founded 2013

Website

Pricing

Output / 1M

$0.320

Input / 1M

$0.100

Cheapest of 11 routes · OpenRouter

Providers(11)

Cloudflare Workers AI NVIDIA NIM GroqCloud Together AI Arcee AI Novita AI Chutes AI OpenRouter AWS Bedrock Microsoft Foundry Vercel AI Gateway

View 11 provider routes

Links

Website