LLM Reference

Qwen3-32B on Novita AI

Qwen3 · Alibaba

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use Qwen3-32B on Novita AI?

Novita AI offers Qwen3-32B with pay-as-you-go pricing at $0.10/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Compare Qwen3-32B across 6 providers to find the best fit for your use case
Input / 1M
$0.10
Output / 1M
$0.45
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: qwen3-32b-fp8
Model ID
qwen3-32b-fp8

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

  • Use provider model ID "qwen3-32b-fp8", not the LLMReference slug "qwen3-32b".

Compare Qwen3-32B Across Providers

ProviderInput (per 1M)Output (per 1M)
Fireworks AI$0.90$0.90
GroqCloud$0.29$0.59
AWS Bedrock$0.15$0.62
OpenRouter$0.08$0.24
Vercel AI Gateway$0.16$0.64
View all 6 providers →

Pricing

TypePrice (per 1M)
Input tokens$0.10
Output tokens$0.45

Capabilities

Structured Outputs

About Qwen3-32B

Qwen3-32B is Alibaba's Qwen3 model. It offers a 40K-token context window.

FAQ

What does Qwen3-32B cost on Novita AI?

On Novita AI, Qwen3-32B costs $0.1 per 1M input tokens and $0.45 per 1M output tokens.

What is the context window for Qwen3-32B on Novita AI?

Qwen3-32B supports a 40,960 token context window on Novita AI.

How does Novita AI compare to other Qwen3-32B providers?

Qwen3-32B is available from 6 providers. The cheapest input pricing is $0.08/1M tokens from OpenRouter.

What API model ID do I use for Qwen3-32B on Novita AI?

Use the model ID qwen3-32b-fp8 when calling Novita AI's API.

Who created Qwen3-32B?

Qwen3-32B was created by Alibaba as part of the Qwen3 model family.

Is Qwen3-32B open source?

Qwen3-32B is not open source; the seed data lists it as proprietary.

Get Started