LLM ReferenceLLM Reference

GLM Image Models by Zhipu AI

Zhipu AIApache 2.0
1 model2026

About

Zhipu AI's (Z.ai) flagship image generation family using a hybrid 16B architecture: a 9B autoregressive language model combined with a 7B diffusion decoder. Pioneered precise Chinese and English text rendering in AI-generated images. The first open-source multimodal image model trained entirely on Huawei Ascend hardware.

Specifications(1 models)

GLM Image model specifications comparison
ModelReleasedParametersMultimodal
GLM Image2026-0116BYes

Frequently Asked Questions

What is GLM Image used for?
GLM Image is used for image generation, vision and multimodal work, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does GLM Image compare to GLM-5?
GLM Image by Zhipu AI is strongest where you need image generation, while GLM-5 by Zhipu AI is the closest related family to check for vision and multimodal work. GLM Image has 1 listed variant, while GLM-5 reaches up to 262K context, so compare the specs and pricing tables before choosing a production model.
Which GLM Image model should I use?
If price is the main constraint, use the pricing table first because GLM Image does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate GLM Image with multimodal inputs.

Models(1)