Last refreshed 2026-05-01. Next refresh: weekly.
Why use Llama 3.1 Nemotron Nano VL 8B v1 on NVIDIA NIM?
NVIDIA NIM offers Llama 3.1 Nemotron Nano VL 8B v1 with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Docs fallbackInstall
Use the provider REST API or SDKAuth
Create a provider API keyCall
model: nvidia/llama-3.1-nemotron-nano-vl-8b-v1Model ID
nvidia/llama-3.1-nemotron-nano-vl-8b-v1Request example
Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID
nvidia/llama-3.1-nemotron-nano-vl-8b-v1.Gotchas
- Use provider model ID "nvidia/llama-3.1-nemotron-nano-vl-8b-v1", not the LLMReference slug "llama-3.1-nemotron-nano-vl-8b-v1".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
VisionMultimodal
About Llama 3.1 Nemotron Nano VL 8B v1
Vision-language variant of NVIDIA Nemotron Nano 8B with multimodal capabilities.
FAQ
What is the context window for Llama 3.1 Nemotron Nano VL 8B v1 on NVIDIA NIM?
Llama 3.1 Nemotron Nano VL 8B v1 supports a 4,000 token context window on NVIDIA NIM.
What API model ID do I use for Llama 3.1 Nemotron Nano VL 8B v1 on NVIDIA NIM?
Use the model ID nvidia/llama-3.1-nemotron-nano-vl-8b-v1 when calling NVIDIA NIM's API.
Who created Llama 3.1 Nemotron Nano VL 8B v1?
Llama 3.1 Nemotron Nano VL 8B v1 was created by NVIDIA AI as part of the Nemotron Nano 2 model family.
Is Llama 3.1 Nemotron Nano VL 8B v1 open source?
Llama 3.1 Nemotron Nano VL 8B v1 is open source according to the seed data.
Model Specs
Released2025-03-01
Parameters8B
Context4K
ArchitectureDecoder Only