Last refreshed 2026-05-01. Next refresh: weekly.
Why use Llama 3.3 70B Instruct (free) on NVIDIA NIM?
NVIDIA NIM offers Llama 3.3 70B Instruct (free) with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Compare Llama 3.3 70B Instruct (free) across 9 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: meta/llama-3.3-70b-instructmeta/llama-3.3-70b-instructRequest example
meta/llama-3.3-70b-instruct.Gotchas
- Use provider model ID "meta/llama-3.3-70b-instruct", not the LLMReference slug "llama-3.3-70b-instruct".
Compare Llama 3.3 70B Instruct (free) Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| NVIDIA NIM | — | — |
| GroqCloud | $0.59 | $0.79 |
| Together AI | $0.44 | $0.44 |
| Arcee AI | $0.60 | $1.80 |
| Novita AI | $0.25 | $0.75 |
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 4xH100 |
Capabilities
About Llama 3.3 70B Instruct (free)
Meta: Llama 3.3 70B Instruct (free) available via OpenRouter. Pricing: $null/1M input, $null/1M output.
FAQ
What is the context window for Llama 3.3 70B Instruct (free) on NVIDIA NIM?
Llama 3.3 70B Instruct (free) supports a 66,000 token context window on NVIDIA NIM.
How does NVIDIA NIM compare to other Llama 3.3 70B Instruct (free) providers?
Llama 3.3 70B Instruct (free) is available from 9 providers. The cheapest input pricing is $0.1/1M tokens from OpenRouter.
What API model ID do I use for Llama 3.3 70B Instruct (free) on NVIDIA NIM?
Use the model ID meta/llama-3.3-70b-instruct when calling NVIDIA NIM's API.
Who created Llama 3.3 70B Instruct (free)?
Llama 3.3 70B Instruct (free) was created by AI at Meta as part of the Llama 3.3 model family.
Is Llama 3.3 70B Instruct (free) open source?
Llama 3.3 70B Instruct (free) is open source according to the seed data.