Last refreshed 2026-05-01. Next refresh: weekly.
Why use Granite 3.3 8B Instruct on NVIDIA NIM?
NVIDIA NIM offers Granite 3.3 8B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Compare Granite 3.3 8B Instruct across 2 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: ibm/granite-3.3-8b-instructibm/granite-3.3-8b-instructRequest example
ibm/granite-3.3-8b-instruct.Gotchas
- Use provider model ID "ibm/granite-3.3-8b-instruct", not the LLMReference slug "granite-3.3-8b-instruct".
Compare Granite 3.3 8B Instruct Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| NVIDIA NIM | — | — |
| Replicate API | $0.03 | $0.25 |
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
About Granite 3.3 8B Instruct
IBM Granite 3.3 8B with improved reasoning capabilities. Part of IBM's enterprise-focused Granite model family optimized for instruction following.
FAQ
What is the context window for Granite 3.3 8B Instruct on NVIDIA NIM?
Granite 3.3 8B Instruct supports a 128,000 token context window on NVIDIA NIM.
How does NVIDIA NIM compare to other Granite 3.3 8B Instruct providers?
Granite 3.3 8B Instruct is available from 2 providers. The cheapest input pricing is $0.03/1M tokens from Replicate API.
What API model ID do I use for Granite 3.3 8B Instruct on NVIDIA NIM?
Use the model ID ibm/granite-3.3-8b-instruct when calling NVIDIA NIM's API.
Who created Granite 3.3 8B Instruct?
Granite 3.3 8B Instruct was created by IBM Research as part of the Granite 3 model family.
Is Granite 3.3 8B Instruct open source?
Granite 3.3 8B Instruct is open source according to the seed data.