NVIDIA NIM Models — Pricing & Benchmarks
143 models available · NVIDIA
NVIDIA NIM hosts 143 AI models in this catalog. The lowest listed input price is Nemotron 3 Super-120B-A12B at $0.1/1M input tokens. LLM Reference lets you compare these models across all 80 providers without switching tabs.
Hosting moonshotai/kimi-k2.6? Browse Kimi K2.6 on NVIDIA NIM for GPU-hour pricing context or open the step-by-step NIM guide.
Where else to run this
Pricing Overview
About NVIDIA NIM
NIM packages inference runtimes and model profiles into containers that expose standard API surfaces such as chat completions, completions, model listing, tokenization, health, and management endpoints. The hosted API path is useful for prototyping and catalog discovery, while the NGC/container path is the self-hosted route for teams that want GPU-hour infrastructure control, private-network deployment, Kubernetes scaling, or NVIDIA AI Enterprise support. Per-token pricing is not a universal provider-level claim in the current seed data; pricing should stay attached to sourced model-provider rows or NVIDIA's current catalog terms.
Full provider profile →