Qwen Image Models by Alibaba
2 models2025
Details
ResearcherAlibaba
LicenseApache 2.0OSI-approved
Commercial useCommercial use: permitted
Models2
Released2025
Capabilities
MultimodalAll models
About
Alibaba's Qwen Image family of text-to-image generation models built on Multimodal Diffusion Transformer (MMDiT) architecture. Achieves commercial-grade Chinese and English text rendering. Open-source on HuggingFace (Qwen/Qwen-Image), part of Alibaba's Tongyi/Qwen AI ecosystem.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
2 in view
Qwen ImageCurrent
Use when the workload needs image generation, 20B parameters, and multimodal inputs.
2025-08image generation20B parametersmultimodal inputs
Qwen Image MaxCurrent
Use when the workload needs image generation and multimodal inputs.
2025-08image generationmultimodal inputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Qwen Image | Use when the workload needs image generation, 20B parameters, and multimodal inputs. | 2025-08 | image generation20B parametersmultimodal inputs | Current |
| Qwen Image Max | Use when the workload needs image generation and multimodal inputs. | 2025-08 | image generationmultimodal inputs | Current |
Release Timeline
1 release group2025-08
2 current
Qwen Image
Currentimage generation20B parametersmultimodal inputs
Qwen Image Max
Currentimage generationmultimodal inputs
Specifications(2 models)
| Model | Released | Parameters | Multimodal |
|---|---|---|---|
| Qwen Image | 2025-08 | 20B | Yes |
| Qwen Image Max | 2025-08 | — | Yes |
Frequently Asked Questions
- What is Qwen Image used for?
- Qwen Image is used for image generation and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
- How does Qwen Image compare to Tongyi DeepResearch?
- Qwen Image by Alibaba is strongest where you need image generation, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen Image has 2 listed variants, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
- Which Qwen Image model should I use?
- If price is the main constraint, use the pricing table first because Qwen Image does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Qwen Image with multimodal inputs.





