GLM-4V 9B
GLM-4V 9B is worth evaluating for long context and vision when its provider route and context window match the workload.
Use it for
- Teams evaluating long context and vision
- Workloads that can use a 131k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
- Family
- GLM-4
- Released
- 2024-06-05
- Context
- 131k
- Parameters
- 9B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
Cheapest of 1 route · Replicate API
About
GLM-4V 9B is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model with multimodal text and image input. It offers a 128K-token context window and scores 48.3 on MMMU.
GLM-4V 9B is a model in the GLM-4 family. The structured metadata tracks a 131k-token context window and multimodal input. This page tracks provider routes through Replicate API, with the cheapest tracked route listed at $0.05 input and $0.25 output per 1M tokens. Headline tracked benchmarks include Massive Multi-discipline Multimodal Understanding 48.3.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Vision
Q/$ A1 relevant benchmark in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Replicate API | $0.050 | $0.250 | Serverless |
Capabilities
Benchmark peer barsfor Vision
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multi-discipline Multimodal Understanding | 48.3 | — | https://mmmu-benchmark.github.io/ |
Migration checks
No linked migration route is available for this model yet.