LLM ReferenceLLM Reference

Qwen Models by Alibaba

5 models2023Up to 33K ctxFrom $0.05/1M input

About

Qwen is a family of 5 AI models by Alibaba, released in 2023.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view
Qwen-72BCurrent

Use when the workload needs 72B parameters.

2023-1172B parameters
Qwen-1.8BCurrent

Use when the workload needs 4K context and 1.8B parameters.

2023-114K context1.8B parameters

Use when the workload needs 33K context and 72B parameters.

2023-1133K context72B parameters
Qwen-14BCurrent

Use when the workload needs 32K context and 14B parameters.

2023-0932K context14B parameters
Qwen-7BCurrent

Use when the workload needs 8K context and 7B parameters.

2023-088K context7B parameters

Release Timeline

3 release groups
2023-11
3 current
Fireworks Qwen-72B-Chat
33K context72B parameters
Current
Qwen-1.8B
4K context1.8B parameters
Current
Qwen-72B
72B parameters
Current
2023-09
1 current
Qwen-14B
32K context14B parameters
Current
2023-08
1 current
Qwen-7B
8K context7B parameters
Current

Specifications(5 models)

Qwen model specifications comparison
ModelReleasedContextParameters
Qwen-72B2023-1172B
Qwen-1.8B2023-114K1.8B
Fireworks Qwen-72B-Chat2023-1133K72B
Qwen-14B2023-0932K14B
Qwen-7B2023-088K7B

Available From(2 providers)

Pricing

Qwen model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen-7BReplicate API$0.05$0.25Serverless
Qwen-14BReplicate API$0.1$0.5Serverless
Qwen-14BFireworks AI$0.2$0.2Provisioned
Fireworks Qwen-72B-ChatFireworks AI$0.8$0.8Serverless
Qwen-72BFireworks AI$0.9$0.9Provisioned

Frequently Asked Questions

What is Qwen used for?
Qwen is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Qwen compare to Tongyi DeepResearch?
Qwen by Alibaba is strongest where you need coding, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen has 5 listed variants and reaches up to 33K context, while Tongyi DeepResearch reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which Qwen model should I use?
For the lowest listed input price, start with Qwen-7B through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Fireworks Qwen-72B-Chat with 33K context.

Models(5)