GLM Z1 Rumination 32B
GLM Z1 Rumination 32B is worth evaluating for long context when its provider route and context window match the workload.
Use it for
- Teams evaluating long context
- Workloads that can use a 128k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest of 1 route · Fireworks AI
About
GLM Z1 Rumination 32B is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-Z1 model focused on step-by-step reasoning. It offers a 128K-token context window.
GLM Z1 Rumination 32B is an open-source model in the GLM-Z1 family. The structured metadata tracks a 128k-token context window and reasoning. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for GLM Z1 Rumination 32B yet.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.900 | $0.900 | Serverless |
Available via routers & gateways(1)
Capabilities
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Frequently asked questions
What is the context window of GLM Z1 Rumination 32B?
GLM Z1 Rumination 32B has a context window of 128k tokens.
How much does GLM Z1 Rumination 32B cost?
GLM Z1 Rumination 32B is available at $0.9/1M input tokens through Fireworks AI.
When was GLM Z1 Rumination 32B released?
GLM Z1 Rumination 32B was released on 2025-01-01.
Which providers offer GLM Z1 Rumination 32B?
GLM Z1 Rumination 32B is available from 1 provider: Fireworks AI.
Cheapest of 1 route · Fireworks AI