Last refreshed 2026-05-01. Next refresh: weekly.
Why use Nemotron Mini 4B Instruct on NVIDIA NIM?
NVIDIA NIM offers Nemotron Mini 4B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: nvidia/nemotron-mini-4b-instructnvidia/nemotron-mini-4b-instructRequest example
nvidia/nemotron-mini-4b-instruct.Gotchas
- Use provider model ID "nvidia/nemotron-mini-4b-instruct", not the LLMReference slug "nemotron-mini-4b-instruct".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
Capabilities
No model capability flags are currently sourced.
About Nemotron Mini 4B Instruct
NVIDIA Nemotron Mini 4B Instruct, a small instruction-tuned model for chat applications.
FAQ
What is the context window for Nemotron Mini 4B Instruct on NVIDIA NIM?
Nemotron Mini 4B Instruct supports a 4k token context window on NVIDIA NIM.
What API model ID do I use for Nemotron Mini 4B Instruct on NVIDIA NIM?
Use the model ID nvidia/nemotron-mini-4b-instruct when calling NVIDIA NIM's API.
Who created Nemotron Mini 4B Instruct?
Nemotron Mini 4B Instruct was created by NVIDIA AI as part of the Nemotron-4 model family.
Is Nemotron Mini 4B Instruct open source?
Nemotron Mini 4B Instruct has open weights under NVIDIA Open Model according to the seed data, but that does not necessarily mean an OSI-approved open-source license.