LLM Reference

Qwen3 Models by Alibaba

19 models2025Up to 1m ctxFrom $0.035/1M input

About

Qwen3 is a family of 19 AI models by Alibaba, released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

19 in view
Qwen3-105BCurrent

Use when the workload needs 128k context, 105B parameters, and tool use.

2025-12128k context105B parameterstool use

Use when the workload needs structured outputs.

2025-12structured outputs
Qwen-PlusCurrent

Use when the workload needs 1m context.

2025-111m context
Qwen3-8BCurrent

Use when the workload needs 128k context, 8B parameters, and structured outputs.

2025-08128k context8B parametersstructured outputs

Use when the workload needs 128k context and 8B parameters.

2025-08128k context8B parameters
Qwen3-70BCurrent

Use when the workload needs 128k context and 70B parameters.

2025-08128k context70B parameters

Use when the workload needs 128k context and 70B parameters.

2025-08128k context70B parameters
Qwen-FlashCurrent

Use when the workload needs 1m context.

2025-081m context

Use when the workload needs 128k context, 235B parameters, and structured outputs.

2025-04128k context235B parametersstructured outputs
Qwen3-32BCurrent

Use when the workload needs 40k context, 32B parameters, and structured outputs.

2025-0440k context32B parametersstructured outputs
Qwen3-MaxCurrent

Use when the workload needs 262k context, tool use, and function calling.

2025-04262k contexttool usefunction calling

Use when the workload needs 128k context, 30B parameters, and structured outputs.

2025-04128k context30B parametersstructured outputs
Qwen3-9BCurrent

Use when the workload needs 256k context, 9B parameters, and structured outputs.

2025-04256k context9B parametersstructured outputs

Use when the workload needs 160k context, 8B parameters, and reasoning.

2025-01160k context8B parametersreasoning
Qwen3-0.6BCurrent

Use when the workload needs 40k context and 600M parameters.

2025-0140k context600M parameters
Qwen3-14BCurrent

Use when the workload needs 40k context, 14B parameters, and structured outputs.

2025-0140k context14B parametersstructured outputs
Qwen3-1.7BCurrent

Use when the workload needs 40k context and 1.7B parameters.

2025-0140k context1.7B parameters
Qwen3-4BCurrent

Use when the workload needs 40k context and 4B parameters.

2025-0140k context4B parameters

Use when the workload needs 160k context, 671B parameters, and reasoning.

2025-01160k context671B parametersreasoning

Release Timeline

5 release groups
2025-12
2 current
Qwen3-105B
128k context105B parameterstool use
Current
Qwen3-Next-80B-A3B
structured outputs
Current
2025-11
1 current
Qwen-Plus
1m context
Current
2025-08
5 current
Qwen-Flash
1m context
Current
Qwen3-70B
128k context70B parameters
Current
Qwen3-70B-Instruct
128k context70B parameters
Current
Qwen3-8B
128k context8B parametersstructured outputs
Current
Qwen3-8B-Instruct
128k context8B parameters
Current
2025-04
5 current
Qwen3-235B-A22B
128k context235B parametersstructured outputs
Current
Qwen3-30B-A3B
128k context30B parametersstructured outputs
Current
Qwen3-32B
40k context32B parametersstructured outputs
Current
Qwen3-9B
256k context9B parametersstructured outputs
Current
Qwen3-Max
262k contexttool usefunction calling
Current
2025-01
6 current
DeepSeek R1 0528 Distill Qwen3-8B
160k context8B parametersreasoning
Current
DeepSeek R1 0528 Qwen3-8B
160k context671B parametersreasoning
Current
Qwen3-0.6B
40k context600M parameters
Current
Qwen3-1.7B
40k context1.7B parameters
Current
Qwen3-14B
40k context14B parametersstructured outputs
Current
Qwen3-4B
40k context4B parameters
Current

Specifications(19 models)

Qwen3 model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningFn CallingTool UseStructured Outputs
Qwen3-105B2025-12128k105BNoNoNoYesYesNo
Qwen3-Next-80B-A3B2025-1280B (3B active)NoNoNoNoNoYes
Qwen-Plus2025-111mNoNoNoNoNoNo
Qwen3-8B2025-08128k8BNoNoNoNoNoYes
Qwen3-8B-Instruct2025-08128k8BNoNoNoNoNoNo
Qwen3-70B2025-08128k70BNoNoNoNoNoNo
Qwen3-70B-Instruct2025-08128k70BNoNoNoNoNoNo
Qwen-Flash2025-081mNoNoNoNoNoNo
Qwen3-235B-A22B2025-04128k235BNoNoNoNoNoYes
Qwen3-32B2025-0440k32BNoNoNoNoNoYes
Qwen3-Max2025-04262kYesYesNoYesYesYes
Qwen3-30B-A3B2025-04128k30BNoNoNoNoNoYes
Qwen3-9B2025-04256k9BNoNoNoNoNoYes
DeepSeek R1 0528 Distill Qwen3-8B2025-01160k8BNoNoYesNoNoNo
Qwen3-0.6B2025-0140k0.6BNoNoNoNoNoNo
Qwen3-14B2025-0140k14BNoNoNoNoNoYes
Qwen3-1.7B2025-0140k1.7BNoNoNoNoNoNo
Qwen3-4B2025-0140k4BNoNoNoNoNoNo
DeepSeek R1 0528 Qwen3-8B2025-01160k671BNoNoYesNoNoNo

Available From(9 providers)

Pricing

Qwen3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen3-8BNovita AI$0.035$0.138Serverless
Qwen3-9BDeepInfra$0.04$0.2Serverless
Qwen3-8BOpenRouter$0.05$0.4Serverless
Qwen3-14BOpenRouter$0.06$0.24Serverless
DeepSeek R1 0528 Qwen3-8BNovita AI$0.06$0.09Serverless
Qwen3-30B-A3BOpenRouter$0.08$0.28Serverless
Qwen3-32BOpenRouter$0.08$0.24Serverless
Qwen3-30B-A3BVercel AI Gateway$0.08$0.29Serverless
Qwen3-235B-A22BNovita AI$0.09$0.58Serverless
Qwen3-30B-A3BNovita AI$0.09$0.45Serverless
Qwen3-Next-80B-A3BOpenRouter$0.0975$0.78Serverless
Qwen3-0.6BFireworks AI$0.1$0.1Serverless
Qwen3-1.7BFireworks AI$0.1$0.1Serverless
Qwen3-30B-A3BAWS Bedrock$0.1$0.3Serverless
Qwen3-32BNovita AI$0.1$0.45Serverless
Qwen3-14BVercel AI Gateway$0.12$0.24Serverless
Qwen3-32BAWS Bedrock$0.15$0.62Serverless
Qwen3-Next-80B-A3BAWS Bedrock$0.15$1.2Serverless
Qwen3-Next-80B-A3BNovita AI$0.15$1.5Serverless
Qwen3-Next-80B-A3BNovita AI$0.15$1.5Serverless
Qwen3-32BVercel AI Gateway$0.16$0.64Serverless
DeepSeek R1 0528 Distill Qwen3-8BFireworks AI$0.2$0.2Serverless
Qwen3-14BFireworks AI$0.2$0.2Serverless
Qwen3-4BFireworks AI$0.2$0.2Serverless
Qwen3-8BFireworks AI$0.2$0.2Serverless
DeepSeek R1 0528 Qwen3-8BFireworks AI$0.2$0.2Serverless
Qwen3-235B-A22BNovita AI$0.2$0.8Serverless
Qwen-FlashAlibaba Cloud PAI-EAS$0.25$2Serverless
Qwen3-32BGroqCloud$0.29$0.59Serverless
Qwen3-235B-A22BNovita AI$0.3$3Serverless
Qwen3-235B-A22BAWS Bedrock$0.4$1.2Serverless
Qwen3-235B-A22BOpenRouter$0.455$1.82Serverless
Qwen3-30B-A3BFireworks AI$0.5$0.5Serverless
Qwen3-MaxOpenRouter$0.78$3.9Serverless
Qwen3-32BFireworks AI$0.9$0.9Serverless
Qwen3-235B-A22BFireworks AI$1.2$1.2Serverless
Qwen-PlusAlibaba Cloud PAI-EAS$1.2$3.6Serverless
Qwen3-MaxVercel AI Gateway$1.2$6Serverless
Qwen3-MaxNovita AI$2.11$8.45Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is Qwen3 used for?
Qwen3 is used for vision and multimodal work, reasoning, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does Qwen3 compare to Tongyi DeepResearch?
Qwen3 by Alibaba is strongest where you need vision and multimodal work, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen3 has 19 listed variants and reaches up to 1m context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which Qwen3 model should I use?
For the lowest listed input price, start with Qwen3-8B through Novita AI at $0.035/1M input tokens. For the most capable/latest local choice, evaluate Qwen3-Max with 262k context and tool use, function calling, structured outputs, and multimodal inputs.

Models(19)