LLM Reference

GLM 4.7 on Novita AI

GLM-4 · Tsinghua Knowledge Engineering Group (THUDM)

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use GLM 4.7 on Novita AI?

Novita AI offers GLM 4.7 with pay-as-you-go pricing at $0.60/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Compare GLM 4.7 across 3 providers to find the best fit for your use case
Input / 1M
$0.60
Output / 1M
$2.20
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: glm-4.7
Model ID
glm-4.7

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare GLM 4.7 Across Providers

ProviderInput (per 1M)Output (per 1M)
Fireworks AI$0.60$2.20
Vercel AI Gateway$2.25$2.75
Novita AI$0.60$2.20

Pricing

TypePrice (per 1M)
Input tokens$0.60
Output tokens$2.20

Capabilities

Function CallingTool UseStructured OutputsCode Execution

About GLM 4.7

GLM-4.7 is Z.ai's flagship text model featuring enhanced programming capabilities and deeper reasoning at 200K context, succeeding GLM-4.6.

FAQ

What does GLM 4.7 cost on Novita AI?

On Novita AI, GLM 4.7 costs $0.6 per 1M input tokens and $2.2 per 1M output tokens.

What is the context window for GLM 4.7 on Novita AI?

GLM 4.7 supports a 204,800 token context window on Novita AI.

How does Novita AI compare to other GLM 4.7 providers?

GLM 4.7 is available from 3 providers. The cheapest input pricing is $0.60/1M tokens from Fireworks AI.

What API model ID do I use for GLM 4.7 on Novita AI?

Use the model ID glm-4.7 when calling Novita AI's API.

Who created GLM 4.7?

GLM 4.7 was created by Tsinghua Knowledge Engineering Group (THUDM) as part of the GLM-4 model family.

Is GLM 4.7 open source?

GLM 4.7 is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2026-03-01
Parameters358B (32B active)
Context200K
ArchitectureDecoder Only