LLM Reference

Qwen3-8B on Novita AI

Qwen3 · Alibaba

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use Qwen3-8B on Novita AI?

Novita AI offers Qwen3-8B with pay-as-you-go pricing at $0.04/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Compare Qwen3-8B across 3 providers to find the best fit for your use case
Input / 1M
$0.035
Output / 1M
$0.138
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: qwen3-8b-fp8
Model ID
qwen3-8b-fp8

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

  • Use provider model ID "qwen3-8b-fp8", not the LLMReference slug "qwen3-8b".

Compare Qwen3-8B Across Providers

ProviderInput (per 1M)Output (per 1M)
Fireworks AI$0.20$0.20
OpenRouter$0.05$0.40
Novita AI$0.04$0.14

Pricing

TypePrice (per 1M)
Input tokens$0.04
Output tokens$0.14

Capabilities

Structured Outputs

About Qwen3-8B

Compact base model in Qwen3 series, supporting 100+ languages with strong reasoning and coding capabilities.

FAQ

What does Qwen3-8B cost on Novita AI?

On Novita AI, Qwen3-8B costs $0.035 per 1M input tokens and $0.138 per 1M output tokens.

What is the context window for Qwen3-8B on Novita AI?

Qwen3-8B supports a 131,072 token context window on Novita AI.

How does Novita AI compare to other Qwen3-8B providers?

Qwen3-8B is available from 3 providers. The cheapest input pricing is $0.035/1M tokens from Novita AI.

What API model ID do I use for Qwen3-8B on Novita AI?

Use the model ID qwen3-8b-fp8 when calling Novita AI's API.

Who created Qwen3-8B?

Qwen3-8B was created by Alibaba as part of the Qwen3 model family.

Is Qwen3-8B open source?

Qwen3-8B is open source under Apache 2.0 according to the seed data.

Get Started