LLM ReferenceLLM Reference
NVIDIA NIM

ChatGLM3 6B on NVIDIA NIM

ChatGLM3 · Tsinghua Knowledge Engineering Group (THUDM)

Provisioned

Last refreshed 2026-05-01. Next refresh: weekly.

Why use ChatGLM3 6B on NVIDIA NIM?

NVIDIA NIM offers ChatGLM3 6B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: chatglm3-6b
Model ID
chatglm3-6b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID chatglm3-6b.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

No model capability flags are currently sourced.

About ChatGLM3 6B

ChatGLM3-6B is an open-source large language model created by Zhipu AI and Tsinghua University, designed for a variety of natural language processing tasks. Operating with a transformer-based architecture and fewer than 10 billion parameters, it excels in multi-turn dialogue, function invocation, long-form text understanding, and supports both Chinese and English. Additionally, its open-source nature fosters innovation and collaboration in the AI community, while its robust performance surpasses many models on tasks like semantics and coding.

FAQ

What is the context window for ChatGLM3 6B on NVIDIA NIM?

ChatGLM3 6B supports a 8,000 token context window on NVIDIA NIM.

Who created ChatGLM3 6B?

ChatGLM3 6B was created by Tsinghua Knowledge Engineering Group (THUDM) as part of the ChatGLM3 model family.

Is ChatGLM3 6B open source?

ChatGLM3 6B's open source status is unknown in the seed data.

Get Started

Model Specs

Released2024-01-30
Parameters6B
Context8K
ArchitectureDecoder Only