What is Qwen3 used for?

Qwen3 is used for vision and multimodal work, reasoning, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.

How does Qwen3 compare to Tongyi DeepResearch?

Qwen3 by Alibaba is strongest where you need vision and multimodal work, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen3 has 19 listed variants and reaches up to 1m context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.

Which Qwen3 model should I use?

For the lowest listed input price, start with Qwen3-8B through Novita AI at $0.035/1M input tokens. For the most capable/latest local choice, evaluate Qwen3-Max with 262k context and tool use, function calling, structured outputs, and multimodal inputs.

Qwen3 Models by Alibaba

AlibabaApache 2.0Open source

19 models2025Up to 1m ctxFrom $0.035/1M input

Details

ResearcherAlibaba

LicenseApache 2.0OSI-approved

Commercial useCommercial use: permitted

Models19

Released2025

Max context1m

Capabilities

Vision1 of 19 models

Multimodal1 of 19 models

Reasoning2 of 19 models

Function Calling2 of 19 models

Tool Use2 of 19 models

Structured Outputs8 of 19 models

About

Qwen3 is a family of 19 AI models by Alibaba, released in 2025.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

19 in view

Qwen3-105BCurrent

Use when the workload needs 128k context, 105B parameters, and tool use.

2025-12128k context105B parameterstool use

Qwen3-Next-80B-A3BCurrent

Use when the workload needs structured outputs.

2025-12structured outputs

Qwen-PlusCurrent

Use when the workload needs 1m context.

2025-111m context

Qwen3-8BCurrent

Use when the workload needs 128k context, 8B parameters, and structured outputs.

2025-08128k context8B parametersstructured outputs

Qwen3-8B-InstructCurrent

Use when the workload needs 128k context and 8B parameters.

2025-08128k context8B parameters

Qwen3-70BCurrent

Use when the workload needs 128k context and 70B parameters.

2025-08128k context70B parameters

Qwen3-70B-InstructCurrent

Use when the workload needs 128k context and 70B parameters.

2025-08128k context70B parameters

Qwen-FlashCurrent

Use when the workload needs 1m context.

2025-081m context

Qwen3-235B-A22BCurrent

Use when the workload needs 128k context, 235B parameters, and structured outputs.

2025-04128k context235B parametersstructured outputs

Qwen3-32BCurrent

Use when the workload needs 40k context, 32B parameters, and structured outputs.

2025-0440k context32B parametersstructured outputs

Qwen3-MaxCurrent

Use when the workload needs 262k context, tool use, and function calling.

2025-04262k contexttool usefunction calling

Qwen3-30B-A3BCurrent

Use when the workload needs 128k context, 30B parameters, and structured outputs.

2025-04128k context30B parametersstructured outputs

Qwen3-9BCurrent

Use when the workload needs 256k context, 9B parameters, and structured outputs.

2025-04256k context9B parametersstructured outputs

DeepSeek R1 0528 Distill Qwen3-8BCurrent

Use when the workload needs 160k context, 8B parameters, and reasoning.

2025-01160k context8B parametersreasoning

Qwen3-0.6BCurrent

Use when the workload needs 40k context and 600M parameters.

2025-0140k context600M parameters

Qwen3-14BCurrent

Use when the workload needs 40k context, 14B parameters, and structured outputs.

2025-0140k context14B parametersstructured outputs

Qwen3-1.7BCurrent

Use when the workload needs 40k context and 1.7B parameters.

2025-0140k context1.7B parameters

Qwen3-4BCurrent

Use when the workload needs 40k context and 4B parameters.

2025-0140k context4B parameters

DeepSeek R1 0528 Qwen3-8BCurrent

Use when the workload needs 160k context, 671B parameters, and reasoning.

2025-01160k context671B parametersreasoning

Current Qwen3 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Qwen3-105B	Use when the workload needs 128k context, 105B parameters, and tool use.	2025-12	128k context105B parameterstool use	Current
Qwen3-Next-80B-A3B	Use when the workload needs structured outputs.	2025-12	structured outputs	Current
Qwen-Plus	Use when the workload needs 1m context.	2025-11	1m context	Current
Qwen3-8B	Use when the workload needs 128k context, 8B parameters, and structured outputs.	2025-08	128k context8B parametersstructured outputs	Current
Qwen3-8B-Instruct	Use when the workload needs 128k context and 8B parameters.	2025-08	128k context8B parameters	Current
Qwen3-70B	Use when the workload needs 128k context and 70B parameters.	2025-08	128k context70B parameters	Current
Qwen3-70B-Instruct	Use when the workload needs 128k context and 70B parameters.	2025-08	128k context70B parameters	Current
Qwen-Flash	Use when the workload needs 1m context.	2025-08	1m context	Current
Qwen3-235B-A22B	Use when the workload needs 128k context, 235B parameters, and structured outputs.	2025-04	128k context235B parametersstructured outputs	Current
Qwen3-32B	Use when the workload needs 40k context, 32B parameters, and structured outputs.	2025-04	40k context32B parametersstructured outputs	Current
Qwen3-Max	Use when the workload needs 262k context, tool use, and function calling.	2025-04	262k contexttool usefunction calling	Current
Qwen3-30B-A3B	Use when the workload needs 128k context, 30B parameters, and structured outputs.	2025-04	128k context30B parametersstructured outputs	Current
Qwen3-9B	Use when the workload needs 256k context, 9B parameters, and structured outputs.	2025-04	256k context9B parametersstructured outputs	Current
DeepSeek R1 0528 Distill Qwen3-8B	Use when the workload needs 160k context, 8B parameters, and reasoning.	2025-01	160k context8B parametersreasoning	Current
Qwen3-0.6B	Use when the workload needs 40k context and 600M parameters.	2025-01	40k context600M parameters	Current
Qwen3-14B	Use when the workload needs 40k context, 14B parameters, and structured outputs.	2025-01	40k context14B parametersstructured outputs	Current
Qwen3-1.7B	Use when the workload needs 40k context and 1.7B parameters.	2025-01	40k context1.7B parameters	Current
Qwen3-4B	Use when the workload needs 40k context and 4B parameters.	2025-01	40k context4B parameters	Current
DeepSeek R1 0528 Qwen3-8B	Use when the workload needs 160k context, 671B parameters, and reasoning.	2025-01	160k context671B parametersreasoning	Current

Release Timeline

5 release groups

2025-12

2 current

Qwen3-105B

128k context105B parameterstool use

Current

Qwen3-Next-80B-A3B

structured outputs

Current

2025-11

1 current

Qwen-Plus

1m context

Current

2025-08

5 current

Qwen-Flash

1m context

Current

Qwen3-70B

128k context70B parameters

Current

Qwen3-70B-Instruct

128k context70B parameters

Current

Qwen3-8B

128k context8B parametersstructured outputs

Current

Qwen3-8B-Instruct

128k context8B parameters

Current

2025-04

5 current

Qwen3-235B-A22B

128k context235B parametersstructured outputs

Current

Qwen3-30B-A3B

128k context30B parametersstructured outputs

Current

Qwen3-32B

40k context32B parametersstructured outputs

Current

Qwen3-9B

256k context9B parametersstructured outputs

Current

Qwen3-Max

262k contexttool usefunction calling

Current

2025-01

6 current

DeepSeek R1 0528 Distill Qwen3-8B

160k context8B parametersreasoning

Current

DeepSeek R1 0528 Qwen3-8B

160k context671B parametersreasoning

Current

Qwen3-0.6B

40k context600M parameters

Current

Qwen3-1.7B

40k context1.7B parameters

Current

Qwen3-14B

40k context14B parametersstructured outputs

Current

Qwen3-4B

40k context4B parameters

Current

Specifications(19 models)

Qwen3 model specifications comparison
Model	Released	Context	Parameters	Vision	Multimodal	Reasoning	Fn Calling	Tool Use	Structured Outputs
Qwen3-105B	2025-12	128k	105B	No	No	No	Yes	Yes	No
Qwen3-Next-80B-A3B	2025-12	—	80B (3B active)	No	No	No	No	No	Yes
Qwen-Plus	2025-11	1m	—	No	No	No	No	No	No
Qwen3-8B	2025-08	128k	8B	No	No	No	No	No	Yes
Qwen3-8B-Instruct	2025-08	128k	8B	No	No	No	No	No	No
Qwen3-70B	2025-08	128k	70B	No	No	No	No	No	No
Qwen3-70B-Instruct	2025-08	128k	70B	No	No	No	No	No	No
Qwen-Flash	2025-08	1m	—	No	No	No	No	No	No
Qwen3-235B-A22B	2025-04	128k	235B	No	No	No	No	No	Yes
Qwen3-32B	2025-04	40k	32B	No	No	No	No	No	Yes
Qwen3-Max	2025-04	262k	—	Yes	Yes	No	Yes	Yes	Yes
Qwen3-30B-A3B	2025-04	128k	30B	No	No	No	No	No	Yes
Qwen3-9B	2025-04	256k	9B	No	No	No	No	No	Yes
DeepSeek R1 0528 Distill Qwen3-8B	2025-01	160k	8B	No	No	Yes	No	No	No
Qwen3-0.6B	2025-01	40k	0.6B	No	No	No	No	No	No
Qwen3-14B	2025-01	40k	14B	No	No	No	No	No	Yes
Qwen3-1.7B	2025-01	40k	1.7B	No	No	No	No	No	No
Qwen3-4B	2025-01	40k	4B	No	No	No	No	No	No
DeepSeek R1 0528 Qwen3-8B	2025-01	160k	671B	No	No	Yes	No	No	No

Available From(11 providers)

Alibaba Cloud PAI-EAS

AWS Bedrock

Cloudflare Workers AI

Pricing

Qwen3 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Qwen3-8B	Novita AI	$0.035	$0.138	Serverless
Qwen3-9B	DeepInfra	$0.04	$0.2	Serverless
Qwen3-8B	OpenRouter	$0.05	$0.4	Serverless
Qwen3-30B-A3B	Cloudflare Workers AI	$0.051	$0.335	Serverless
Qwen3-14B	OpenRouter	$0.06	$0.24	Serverless
DeepSeek R1 0528 Qwen3-8B	Novita AI	$0.06	$0.09	Serverless
Qwen3-30B-A3B	OpenRouter	$0.08	$0.28	Serverless
Qwen3-32B	OpenRouter	$0.08	$0.24	Serverless
Qwen3-30B-A3B	Vercel AI Gateway	$0.08	$0.29	Serverless
Qwen3-235B-A22B	Novita AI	$0.09	$0.58	Serverless
Qwen3-30B-A3B	Novita AI	$0.09	$0.45	Serverless
Qwen3-Next-80B-A3B	OpenRouter	$0.0975	$0.78	Serverless
Qwen3-0.6B	Fireworks AI	$0.1	$0.1	Serverless
Qwen3-1.7B	Fireworks AI	$0.1	$0.1	Serverless
Qwen3-30B-A3B	AWS Bedrock	$0.1	$0.3	Serverless
Qwen3-32B	Novita AI	$0.1	$0.45	Serverless
Qwen3-14B	NextBit	$0.1	$0.24	Serverless
Qwen3-14B	Vercel AI Gateway	$0.12	$0.24	Serverless
Qwen3-30B-A3B	NextBit	$0.14	$0.55	Serverless
Qwen3-32B	AWS Bedrock	$0.15	$0.62	Serverless
Qwen3-Next-80B-A3B	AWS Bedrock	$0.15	$1.2	Serverless
Qwen3-Next-80B-A3B	Novita AI	$0.15	$1.5	Serverless
Qwen3-32B	Vercel AI Gateway	$0.16	$0.64	Serverless
DeepSeek R1 0528 Distill Qwen3-8B	Fireworks AI	$0.2	$0.2	Serverless
Qwen3-14B	Fireworks AI	$0.2	$0.2	Serverless
Qwen3-4B	Fireworks AI	$0.2	$0.2	Serverless
Qwen3-8B	Fireworks AI	$0.2	$0.2	Serverless
DeepSeek R1 0528 Qwen3-8B	Fireworks AI	$0.2	$0.2	Serverless
Qwen-Flash	Alibaba Cloud PAI-EAS	$0.25	$2	Serverless
Qwen3-32B	GroqCloud	$0.29	$0.59	Serverless
Qwen3-235B-A22B	AWS Bedrock	$0.4	$1.2	Serverless
Qwen3-235B-A22B	OpenRouter	$0.455	$1.82	Serverless
Qwen3-30B-A3B	Fireworks AI	$0.5	$0.5	Serverless
Qwen3-Max	OpenRouter	$0.78	$3.9	Serverless
Qwen3-32B	Fireworks AI	$0.9	$0.9	Serverless
Qwen3-235B-A22B	Fireworks AI	$1.2	$1.2	Serverless
Qwen-Plus	Alibaba Cloud PAI-EAS	$1.2	$3.6	Serverless
Qwen3-Max	Vercel AI Gateway	$1.2	$6	Serverless
Qwen3-Max	Novita AI	$2.11	$8.45	Serverless

Popular comparisons in this family

Comparisons

All comparisons →

Frequently Asked Questions

What is Qwen3 used for?: Qwen3 is used for vision and multimodal work, reasoning, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does Qwen3 compare to Tongyi DeepResearch?: Qwen3 by Alibaba is strongest where you need vision and multimodal work, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen3 has 19 listed variants and reaches up to 1m context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which Qwen3 model should I use?: For the lowest listed input price, start with Qwen3-8B through Novita AI at $0.035/1M input tokens. For the most capable/latest local choice, evaluate Qwen3-Max with 262k context and tool use, function calling, structured outputs, and multimodal inputs.

Models(19)

Qwen3-105B

2025-12128k105B

Open Source

Qwen3-Next-80B-A3B

2025-1280B (3B active)4 providers

Open Source

Qwen-Plus

2025-111m1 provider

Open Source

Qwen3-8B

2025-08128k8B3 providers

Qwen3-8B-Instruct

Qwen3-70B

Qwen3-70B-Instruct

Qwen-Flash

Qwen3-235B-A22B

2025-04128k235B5 providers

Open Source

Qwen3-32B

2025-0440k32B6 providers

Open Source

Qwen3-Max

2025-04262k3 providers

MultimodalOpen Source

Qwen3-30B-A3B

2025-04128k30B7 providers

Open Source

Qwen3-9B

2025-04256k9B1 provider

Open Source

DeepSeek R1 0528 Distill Qwen3-8B

2025-01160k8B1 provider

ReasoningOpen Source

Qwen3-0.6B

2025-0140k0.6B1 provider

Open Source

Qwen3-14B

2025-0140k14B4 providers

Open Source

Qwen3-1.7B

2025-0140k1.7B1 provider

Open Source

Qwen3-4B

2025-0140k4B1 provider

Open Source

DeepSeek R1 0528 Qwen3-8B

2025-01160k671B2 providers

ReasoningOpen Source

Qwen3 Models by Alibaba

Details

Capabilities

About

Current Variants

Release Timeline

Specifications(19 models)

Available From(11 providers)

Pricing

Popular comparisons in this family

Comparisons

Frequently Asked Questions

Related Model Families

Models(19)