GLM-4 Models by Tsinghua Knowledge Engineering Group (THUDM)
About
The GLM-4 family, developed by Zhipu AI and Tsinghua University, represents an evolving series of large language models renowned for their multilingual capabilities and state-of-the-art performance. Building upon previous ChatGLM generations, these models are pre-trained on an extensive dataset of ten trillion tokens across Chinese, English, and 24 other languages. They undergo rigorous multi-stage post-training, involving supervised fine-tuning and reinforcement learning from human feedback, which enables them to rival or surpass GPT-4 on various benchmarks. The series includes versions like GLM-4, GLM-4-Air, and GLM-4-9B, each tailored for different tasks and resource constraints. A notable feature is the GLM-4 All Tools model that can autonomously use web browsers and Python interpreters for complex task completion. Open-source variants, such as GLM-4-9B and its chat-optimized version, along with multimodal models like GLM-4V-9B, which integrates image processing, highlight the family's versatility. Recent advancements include the GLM-4-Voice model, an end-to-end speech model supporting Chinese and English, further extending the boundaries of open-source LLMs 1 3 5 6 7 8.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 200k context, tool use, and function calling.
Use when the workload needs 128k context, tool use, and function calling.
Use when the workload needs 64k context, tool use, and function calling.
Use when the workload needs 128k context and 9B parameters.
Use when the workload needs 128k context, 32B parameters, and structured outputs.
Use when the workload needs 128k context and structured outputs.
Use when the workload needs 128k context and structured outputs.
Use when the workload needs 198k context and structured outputs.
Use when the workload needs 128k context and structured outputs.
Use when the workload needs 198k context and structured outputs.
Use when the workload needs 131k context and 9B parameters.
Use when the workload needs 131k context, 9B parameters, and multimodal inputs.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| GLM 4.7 | Use when the workload needs 200k context, tool use, and function calling. | 2026-03 | 200k contexttool usefunction calling | Current |
| GLM 4.6V | Use when the workload needs 128k context, tool use, and function calling. | 2026-02 | 128k contexttool usefunction calling | Current |
| GLM 4.5V | Use when the workload needs 64k context, tool use, and function calling. | 2026-01 | 64k contexttool usefunction calling | Current |
| GLM-4 Code 9B | Use when the workload needs 128k context and 9B parameters. | 2025-05 | 128k context9B parameters | Current |
| GLM-4 Air 4B | Use when the workload needs 4B parameters. | 2025-03 | 4B parameters | Current |
| GLM-4 32B | Use when the workload needs 128k context, 32B parameters, and structured outputs. | 2025-03 | 128k context32B parametersstructured outputs | Current |
| GLM-4.7 | Use when the workload needs 128k context and structured outputs. | 2025-01 | 128k contextstructured outputs | Current |
| GLM-4.5 | Use when the workload needs 128k context and structured outputs. | 2025-01 | 128k contextstructured outputs | Current |
| GLM-4.7 Flash | Use when the workload needs 198k context and structured outputs. | 2025-01 | 198k contextstructured outputs | Current |
| GLM-4.5-Air | Use when the workload needs 128k context and structured outputs. | 2025-01 | 128k contextstructured outputs | Current |
| GLM-4.6 | Use when the workload needs 198k context and structured outputs. | 2025-01 | 198k contextstructured outputs | Current |
| GLM-4-Extreme | Use when provider availability and model metadata match the workload. | 2024-06 | — | Current |
| GLM-4-Air | Use when the workload needs 128k context. | 2024-06 | 128k context | Current |
| GLM-4-Flash | Use when the workload needs 128k context. | 2024-06 | 128k context | Current |
| GLM-4 9B | Use when the workload needs 131k context and 9B parameters. | 2024-06 | 131k context9B parameters | Current |
| GLM-4V 9B | Use when the workload needs 131k context, 9B parameters, and multimodal inputs. | 2024-06 | 131k context9B parametersmultimodal inputs | Current |
Release Timeline
7 release groupsSpecifications(16 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|---|
| GLM 4.7 | 2026-03 | 200k | 358B (32B active) | No | No | Yes | Yes | Yes | Yes |
| GLM 4.6V | 2026-02 | 128k | 106B (12B active) | Yes | Yes | Yes | Yes | No | No |
| GLM 4.5V | 2026-01 | 64k | 106B (12B active) | Yes | Yes | Yes | Yes | No | No |
| GLM-4 Code 9B | 2025-05 | 128k | 9B | No | No | No | No | No | No |
| GLM-4 Air 4B | 2025-03 | — | 4B | No | No | No | No | No | No |
| GLM-4 32B | 2025-03 | 128k | 32B | No | No | No | No | Yes | No |
| GLM-4.7 | 2025-01 | 128k | 358B (32B active) | No | No | No | No | Yes | No |
| GLM-4.5 | 2025-01 | 128k | 355B (32B active) | No | No | No | No | Yes | No |
| GLM-4.7 Flash | 2025-01 | 198k | 30B (3B active) | No | No | No | No | Yes | No |
| GLM-4.5-Air | 2025-01 | 128k | 106B (12B active) | No | No | No | No | Yes | No |
| GLM-4.6 | 2025-01 | 198k | 355B (32B active) | No | No | No | No | Yes | No |
| GLM-4-Extreme | 2024-06 | — | — | No | No | No | No | No | No |
| GLM-4-Air | 2024-06 | 128k | — | No | No | No | No | No | No |
| GLM-4-Flash | 2024-06 | 128k | — | No | No | No | No | No | No |
| GLM-4 9B | 2024-06 | 131k | 9B | No | No | No | No | No | No |
| GLM-4V 9B | 2024-06 | 131k | 9B | No | Yes | No | No | No | No |
Available From(10 providers)
Pricing
Frequently Asked Questions
- What is GLM-4 used for?
- GLM-4 is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does GLM-4 compare to ChatGLM-4?
- GLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is strongest where you need vision and multimodal work, while ChatGLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is the closest related family to check for adjacent model selection. GLM-4 has 16 listed variants and reaches up to 200k context, while ChatGLM-4 reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Models(16)
GLM 4.7
GLM 4.6V
GLM 4.5V
GLM-4 Code 9B
GLM-4 Air 4B
GLM-4 32B
GLM-4.7
GLM-4.5
GLM-4.7 Flash
GLM-4.5-Air
GLM-4.6
GLM-4-Extreme
GLM-4-Air
GLM-4-Flash
GLM-4 9B
GLM-4V 9B





