GLM-5 Models by Zhipu AI
6 models2026Up to 262K ctxFrom $0.6/1M input
About
GLM-5 is a family of 6 AI models by Zhipu AI, released in 2026.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
6 in view
GLM-5.1Current
Use when the workload needs 200K context, reasoning, and tool use.
2026-04200K contextreasoningtool use
GLM-5V-TurboCurrent
Use when the workload needs 200K context, reasoning, and tool use.
2026-04200K contextreasoningtool use
GLM-5 TurboCurrent
Use when the workload needs 200K context, reasoning, and tool use.
2026-03200K contextreasoningtool use
GLM-5 9BCurrent
Use when the workload needs 262K context, 9 parameters, and reasoning.
2026-02262K context9 parametersreasoning
GLM-5Current
Use when the workload needs 200K context, reasoning, and tool use.
2026-02200K contextreasoningtool use
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| GLM-5.1 | Use when the workload needs 200K context, reasoning, and tool use. | 2026-04 | 200K contextreasoningtool use | Current |
| GLM-5V-Turbo | Use when the workload needs 200K context, reasoning, and tool use. | 2026-04 | 200K contextreasoningtool use | Current |
| GLM-5 Turbo | Use when the workload needs 200K context, reasoning, and tool use. | 2026-03 | 200K contextreasoningtool use | Current |
| GLM-5 9B | Use when the workload needs 262K context, 9 parameters, and reasoning. | 2026-02 | 262K context9 parametersreasoning | Current |
| GLM-5 | Use when the workload needs 200K context, reasoning, and tool use. | 2026-02 | 200K contextreasoningtool use | Current |
| Zhipu GLM-5 | Use when the workload needs 203K context. | 2026-02 | 203K context | Current |
Release Timeline
3 release groups2026-04
2 current
GLM-5.1
Current200K contextreasoningtool use
GLM-5V-Turbo
Current200K contextreasoningtool use
2026-03
1 current
GLM-5 Turbo
Current200K contextreasoningtool use
2026-02
3 current
GLM-5
Current200K contextreasoningtool use
GLM-5 9B
Current262K context9 parametersreasoning
Zhipu GLM-5
Current203K context
Specifications(6 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Reasoning | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|---|---|
| GLM-5.1 | 2026-04 | 200k | 754B total, 40B active | No | No | Yes | Yes | Yes | Yes | Yes |
| GLM-5V-Turbo | 2026-04 | 200k | 744B total, 40B active | Yes | Yes | Yes | Yes | Yes | Yes | No |
| GLM-5 Turbo | 2026-03 | 200k | 744B total, 40B active | No | No | Yes | Yes | Yes | Yes | No |
| GLM-5 9B | 2026-02 | 262K | 9 | No | No | Yes | Yes | Yes | No | No |
| GLM-5 | 2026-02 | 200k | 744B total, 40B active | No | No | Yes | Yes | Yes | Yes | No |
| Zhipu GLM-5 | 2026-02 | 203K | 744B total, 40B active | No | No | No | No | No | No | No |
Available From(6 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| GLM-5 | OpenRouter | $0.6 | $2.08 | Serverless |
| GLM-5 | Fireworks AI | $1 | $3.2 | Serverless |
| GLM-5 | Together AI | $1 | $3.2 | Serverless |
| GLM-5 | GCP Vertex AI | $1 | $3.2 | Serverless |
| GLM-5.1 | OpenRouter | $1.05 | $3.5 | Serverless |
| GLM-5V-Turbo | OpenRouter | $1.2 | $4 | Serverless |
| GLM-5 Turbo | OpenRouter | $1.2 | $4 | Serverless |
| GLM-5.1 | Z.ai | $1.4 | $4.4 | Serverless |
| GLM-5.1 | Fireworks AI | $1.4 | $4.4 | Serverless |
Comparisons
All comparisons →Frequently Asked Questions
- What is GLM-5 used for?
- GLM-5 is used for vision and multimodal work, reasoning, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
- How does GLM-5 compare to CogView?
- GLM-5 by Zhipu AI is strongest where you need vision and multimodal work, while CogView by Zhipu AI is the closest related family to check for adjacent model selection. GLM-5 has 6 listed variants and reaches up to 262K context, so compare the specs and pricing tables before choosing a production model.
- Which GLM-5 model should I use?
- For the lowest listed input price, start with GLM-5 through OpenRouter at $0.6/1M input tokens. For the most capable/latest local choice, evaluate GLM-5V-Turbo with 200K context and reasoning, tool use, function calling, structured outputs, and multimodal inputs.
Models(6)
GLM-5.1
2026-04200k754B total, 40B active3 providers
ReasoningOpen Source
GLM-5V-Turbo
2026-04200k744B total, 40B active1 provider
MultimodalReasoning
GLM-5 Turbo
2026-03200k744B total, 40B active1 provider
Reasoning
GLM-5 9B
2026-02262K9
ReasoningOpen Source
GLM-5
2026-02200k744B total, 40B active5 providers
Reasoning
Zhipu GLM-5
2026-02203K744B total, 40B active





