Last refreshed 2026-05-22. Next refresh: weekly.
Why use Llama 3.2 3B Instruct on Novita AI?
Novita AI offers Llama 3.2 3B Instruct with pay-as-you-go pricing at $0.03/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Compare Llama 3.2 3B Instruct across 6 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: llama-3.2-3b-instructllama-3.2-3b-instructRequest example
Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Compare Llama 3.2 3B Instruct Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| OpenRouter | $0.05 | $0.34 |
| Fireworks AI | $0.10 | $0.10 |
| NVIDIA NIM | — | — |
| AWS Bedrock | $0.15 | $0.15 |
| Vercel AI Gateway | $0.15 | $0.15 |
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.03 |
| Output tokens | $0.05 |
Capabilities
About Llama 3.2 3B Instruct
Llama 3.2 3B Instruct is Meta's Llama 3.2 model. It offers a 128K-token context window with weights openly available for self-hosting and scores 34.7 on MMLU PRO.
FAQ
What does Llama 3.2 3B Instruct cost on Novita AI?
On Novita AI, Llama 3.2 3B Instruct costs $0.03 per 1M input tokens and $0.05 per 1M output tokens.
What is the context window for Llama 3.2 3B Instruct on Novita AI?
Llama 3.2 3B Instruct supports a 32,768 token context window on Novita AI.
How does Novita AI compare to other Llama 3.2 3B Instruct providers?
Llama 3.2 3B Instruct is available from 6 providers. The cheapest input pricing is $0.03/1M tokens from Novita AI.
What API model ID do I use for Llama 3.2 3B Instruct on Novita AI?
Use the model ID llama-3.2-3b-instruct when calling Novita AI's API.
Who created Llama 3.2 3B Instruct?
Llama 3.2 3B Instruct was created by AI at Meta as part of the Llama 3.2 model family.
Is Llama 3.2 3B Instruct open source?
Llama 3.2 3B Instruct is open source according to the seed data.