LLM Reference
NVIDIA NIM

Granite 8B Code on NVIDIA NIM

Granite Code · IBM Research

Provisioned

Last refreshed 2026-05-19. Next refresh: weekly.

Why use Granite 8B Code on NVIDIA NIM?

NVIDIA NIM offers Granite 8B Code with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Compare Granite 8B Code across 3 providers to find the best fit for your use case
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: granite-8b-code
Model ID
granite-8b-code

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID granite-8b-code.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare Granite 8B Code Across Providers

ProviderInput (per 1M)Output (per 1M)
NVIDIA NIM
RHEL AI
IBM watsonx$0.60$0.60

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

No model capability flags are currently sourced.

About Granite 8B Code

Granite 8B Code is IBM Research's Granite Code model. It is deprecated (originally released 2024-05-06); use it only for reproducing earlier results or evaluating drift over time.

FAQ

What is the context window for Granite 8B Code on NVIDIA NIM?

Granite 8B Code supports a 8,000 token context window on NVIDIA NIM.

How does NVIDIA NIM compare to other Granite 8B Code providers?

Granite 8B Code is available from 3 providers. The cheapest input pricing is $0.6/1M tokens from IBM watsonx.

Who created Granite 8B Code?

Granite 8B Code was created by IBM Research as part of the Granite Code model family.

Is Granite 8B Code open source?

Granite 8B Code's open source status is unknown in the seed data.

Get Started

Model Specs

Released2024-05-06
Parameters8B
Context8K
ArchitectureDecoder Only

Related Models on NVIDIA NIM