Last refreshed 2026-05-22. Next refresh: weekly.
Why use Qwen3-32B on Novita AI?
Novita AI offers Qwen3-32B with pay-as-you-go pricing at $0.10/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Compare Qwen3-32B across 6 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: qwen3-32b-fp8qwen3-32b-fp8Request example
Gotchas
- Use provider model ID "qwen3-32b-fp8", not the LLMReference slug "qwen3-32b".
Compare Qwen3-32B Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Fireworks AI | $0.90 | $0.90 |
| GroqCloud | $0.29 | $0.59 |
| AWS Bedrock | $0.15 | $0.62 |
| OpenRouter | $0.08 | $0.24 |
| Vercel AI Gateway | $0.16 | $0.64 |
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.10 |
| Output tokens | $0.45 |
Capabilities
About Qwen3-32B
Qwen3-32B is Alibaba's Qwen3 model. It offers a 40K-token context window.
FAQ
What does Qwen3-32B cost on Novita AI?
On Novita AI, Qwen3-32B costs $0.1 per 1M input tokens and $0.45 per 1M output tokens.
What is the context window for Qwen3-32B on Novita AI?
Qwen3-32B supports a 40,960 token context window on Novita AI.
How does Novita AI compare to other Qwen3-32B providers?
Qwen3-32B is available from 6 providers. The cheapest input pricing is $0.08/1M tokens from OpenRouter.
What API model ID do I use for Qwen3-32B on Novita AI?
Use the model ID qwen3-32b-fp8 when calling Novita AI's API.
Who created Qwen3-32B?
Qwen3-32B was created by Alibaba as part of the Qwen3 model family.
Is Qwen3-32B open source?
Qwen3-32B is not open source; the seed data lists it as proprietary.