Llama 3.3 70B Instruct (free)
Llama 3.3 70B Instruct (free) is worth evaluating for classification and json / tool use when its provider route and context window match the workload.
Use it for
- Teams evaluating classification and json / tool use
- Workloads that can use a 66k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- Llama 3.3
- Released
- 2024-12-06
- Context
- 66k
- Parameters
- 70B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2023-12
- Specialization
- general
- Openness
- Open weights
- License
- Llama 3 CommunityCommercial use: conditional
Large-scale open-source AI for social technologies.
Cheapest of 11 routes · OpenRouter
About
Meta: Llama 3.3 70B Instruct (free) available via OpenRouter. Pricing: $null/1M input, $null/1M output.
Llama 3.3 70B Instruct (free) is the OpenRouter free-tier entry for Meta's instruction-tuned 70B Llama 3.3 model. The underlying model was released on December 6, 2024 and is designed to deliver much of the Llama 3.1 405B instruction-following quality at a lower serving cost. This seed row, however, is not the canonical full-context model profile: it represents a hosted free listing with a 66K-token context window and no listed token price in the page data.
The free listing is best understood as an evaluation and prototyping path. It can cover chat, coding assistance, multilingual prompts, structured output experiments, and lightweight reasoning tests, but provider-side rate limits and context caps matter more than they do on paid endpoints. Teams that need the full 128K-class Llama 3.3 deployment, a license/self-hosting decision, or production throughput should compare the paid provider rows or the canonical non-free Llama 3.3 entry instead of treating this free SKU as the whole model story.
The price ladder for this slug includes hosted options such as OpenRouter, Groq, Together AI, NVIDIA NIM, Novita AI, AWS Bedrock, Microsoft Foundry, Vercel AI Gateway, and other providers. Use the free OpenRouter URL for quick trials; use the paid rows when context, reliability, and support terms are part of the decision.
Llama 3.3 70B Instruct (free) has a 66k-token context window.
Llama 3.3 70B Instruct (free) input tokens at $0.1/1M, output at $0.32/1M.
Top use-case fit
Classification
Included by capability and metadata signals in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 11Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| OpenRouter | $0.100 | $0.320 | Serverless |
| Novita AI | $0.135 | $0.400 | Serverless |
| Chutes AI | $0.220 | $0.660 | Serverless |
| Microsoft Foundry | $0.710 | $0.710 | ServerlessProvisioned |
Available via routers & gateways(8)
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Portkey
GatewayProduction AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.
Amazon Bedrock Intelligent Prompt Routing
RouterAWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.
Azure AI Foundry Model Router
RouterMicrosoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.
Helicone
GatewayObservability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.
Capabilities
Benchmark peer barsfor Classification
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Compare Llama 3.3 70B Instruct (free) with other models
Frequently asked questions
What is the context window of Llama 3.3 70B Instruct (free)?
Llama 3.3 70B Instruct (free) has a context window of 66k tokens.
How much does Llama 3.3 70B Instruct (free) cost?
Llama 3.3 70B Instruct (free) pricing ranges from $0.1/1M to $1.04/1M input tokens depending on the provider.
When was Llama 3.3 70B Instruct (free) released?
Llama 3.3 70B Instruct (free) was released on 2024-12-06.
Which providers offer Llama 3.3 70B Instruct (free)?
Llama 3.3 70B Instruct (free) is available from 11 providers: Cloudflare Workers AI, NVIDIA NIM, GroqCloud, Together AI, Arcee AI, Novita AI, Chutes AI, OpenRouter, AWS Bedrock, Microsoft Foundry, Vercel AI Gateway.
Large-scale open-source AI for social technologies.
Cheapest of 11 routes · OpenRouter