LLM Reference

GLM-4 9B

Released
2024-06-05
Last refreshed
2026-05-19
Status
Researched 16d ago
Long context

GLM-4 9B is worth evaluating for long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context
  • Workloads that can use a 131k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
GLM-4
Released
2024-06-05
Context
131k
Parameters
9B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Leading China's LLM innovation surge

Beijing, China
Founded 2018
Website
Pricing
Output / 1M
$0.100
Input / 1M
$0.100

Cheapest of 4 routes · AWS Bedrock

About

GLM-4 9B is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model. It offers a 128K-token context window.

GLM-4 9B is a model in the GLM-4 family. The structured metadata tracks a 131k-token context window. This page tracks provider routes through Fireworks AI, Bitdeer AI, AWS Bedrock, and 1 more, with the cheapest tracked route listed at $0.1 input and $0.1 output per 1M tokens. No headline benchmark score is tracked for GLM-4 9B yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 4

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
AWS Bedrock$0.100$0.100
Serverless
GCP Vertex AI$0.100$0.100
Serverless
Fireworks AI$0.200$0.200
Serverless
Bitdeer AI$0.140$0.420
Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)