LLM Reference

GLM Image Models by Zhipu AI

Zhipu AIApache 2.0Open source
1 model2026

Details

ResearcherZhipu AI
LicenseApache 2.0OSI-approved
Commercial useCommercial use: permitted
Models1
Released2026

Capabilities

MultimodalAll models

Links

Website

About

Zhipu AI's (Z.ai) flagship image generation family using a hybrid 16B architecture: a 9B autoregressive language model combined with a 7B diffusion decoder. Pioneered precise Chinese and English text rendering in AI-generated images. The first open-source multimodal image model trained entirely on Huawei Ascend hardware.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view
GLM ImageCurrent

Use when the workload needs image generation, 16B parameters, and multimodal inputs.

2026-01image generation16B parametersmultimodal inputs

Release Timeline

1 release group
2026-01
1 current
GLM Image
image generation16B parametersmultimodal inputs
Current

Specifications(1 models)

GLM Image model specifications comparison
ModelReleasedParametersMultimodal
GLM Image2026-0116BYes

Frequently Asked Questions

What is GLM Image used for?
GLM Image is used for image generation, vision and multimodal work, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does GLM Image compare to GLM-5?
GLM Image by Zhipu AI is strongest where you need image generation, while GLM-5 by Zhipu AI is the closest related family to check for coding. GLM Image has 1 listed variant, while GLM-5 reaches up to 1m context, so compare the specs and pricing tables before choosing a production model.
Which GLM Image model should I use?
If price is the main constraint, use the pricing table first because GLM Image does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate GLM Image with multimodal inputs.