Last refreshed 2026-05-19. Next refresh: weekly.
Why use Llama 3.1 Nemotron Nano 4B v1.1 on NVIDIA NIM?
NVIDIA NIM offers Llama 3.1 Nemotron Nano 4B v1.1 with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: nvidia/llama-3.1-nemotron-nano-4b-v1.1nvidia/llama-3.1-nemotron-nano-4b-v1.1Request example
nvidia/llama-3.1-nemotron-nano-4b-v1.1.Gotchas
- Use provider model ID "nvidia/llama-3.1-nemotron-nano-4b-v1.1", not the LLMReference slug "llama-3.1-nemotron-nano-4b-v1.1".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
Capabilities
No model capability flags are currently sourced.
About Llama 3.1 Nemotron Nano 4B v1.1
Llama 3.1 Nemotron Nano 4B v1.1 is NVIDIA AI's Nemotron Nano 2 model. It was released 2025-04-01.
FAQ
What is the context window for Llama 3.1 Nemotron Nano 4B v1.1 on NVIDIA NIM?
Llama 3.1 Nemotron Nano 4B v1.1 supports a 4k token context window on NVIDIA NIM.
What API model ID do I use for Llama 3.1 Nemotron Nano 4B v1.1 on NVIDIA NIM?
Use the model ID nvidia/llama-3.1-nemotron-nano-4b-v1.1 when calling NVIDIA NIM's API.
Who created Llama 3.1 Nemotron Nano 4B v1.1?
Llama 3.1 Nemotron Nano 4B v1.1 was created by NVIDIA AI as part of the Nemotron Nano 2 model family.
Is Llama 3.1 Nemotron Nano 4B v1.1 open source?
Llama 3.1 Nemotron Nano 4B v1.1 has open weights under Llama 3 Community according to the seed data, but that does not necessarily mean an OSI-approved open-source license.