LLM ReferenceLLM Reference

GLM-Z1 Models by Tsinghua Knowledge Engineering Group (THUDM)

3 models2025Up to 128K ctxFrom $0.2/1M input

About

GLM-Z1 is a family of 3 AI models by Tsinghua Knowledge Engineering Group (THUDM), released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

3 in view
GLM-Z1 9BCurrent

Use when the workload needs 9B parameters.

2025-019B parameters

Use when the workload needs reasoning, 128K context, and 32B parameters.

2025-01reasoning128K context32B parameters
GLM Z1 32BCurrent

Use when the workload needs 128K context and 32B parameters.

2025-01128K context32B parameters

Release Timeline

1 release group
2025-01
3 current
GLM Z1 32B
128K context32B parameters
Current
GLM Z1 Rumination 32B
reasoning128K context32B parameters
Current
GLM-Z1 9B
9B parameters
Current

Specifications(3 models)

GLM-Z1 model specifications comparison
ModelReleasedContextParametersReasoning
GLM-Z1 9B2025-019BNo
GLM Z1 Rumination 32B2025-01128K32BYes
GLM Z1 32B2025-01128K32BNo

Available From(1 provider)

Pricing

GLM-Z1 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
GLM-Z1 9BFireworks AI$0.2$0.2Serverless
GLM Z1 Rumination 32BFireworks AI$0.9$0.9Serverless
GLM Z1 32BFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is GLM-Z1 used for?
GLM-Z1 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.
How does GLM-Z1 compare to ChatGLM-4?
GLM-Z1 by Tsinghua Knowledge Engineering Group (THUDM) is strongest where you need reasoning, while ChatGLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is the closest related family to check for adjacent model selection. GLM-Z1 has 3 listed variants and reaches up to 128K context, so compare the specs and pricing tables before choosing a production model.
Which GLM-Z1 model should I use?
For the lowest listed input price, start with GLM-Z1 9B through Fireworks AI at $0.2/1M input tokens. For the most capable/latest local choice, evaluate GLM Z1 Rumination 32B with 128K context and reasoning.

Models(3)