LLM Reference

QwQ Models by Alibaba

1 model2025Up to 128k ctxFrom $0.66/1M input

Details

ResearcherAlibaba
Models1
Released2025
Max context128k

Capabilities

ReasoningAll models

About

QwQ is a family of 1 AI model by Alibaba, released in 2025.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view
QwQ 32BCurrent

Use when the workload needs reasoning, 128k context, and 32.5B parameters.

2025-03reasoning128k context32.5B parameters

Release Timeline

1 release group
2025-03
1 current
QwQ 32B
reasoning128k context32.5B parameters
Current

Specifications(1 models)

QwQ model specifications comparison
ModelReleasedContextParametersReasoning
QwQ 32B2025-03128k32.5BYes

Available From(2 providers)

Pricing

QwQ model pricing by provider
ModelProviderInput / 1MOutput / 1MType
QwQ 32BCloudflare Workers AI$0.66$1Serverless

Frequently Asked Questions

What is QwQ used for?
QwQ is used for reasoning, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does QwQ compare to Tongyi DeepResearch?
QwQ by Alibaba is strongest where you need reasoning, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. QwQ has 1 listed variant and reaches up to 128k context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which QwQ model should I use?
For the lowest listed input price, start with QwQ 32B through Cloudflare Workers AI at $0.66/1M input tokens. For the most capable/latest local choice, evaluate QwQ 32B with 128k context and reasoning.