NVIDIA AI

53 models across 15 families · Latest: Nemotron-Labs TwoTower 30B-A3B Base (2026-06)

Researched 16d ago

Accelerated AI for enterprise solutions

CodingRAGAgentsLong contextVisionClassificationJSON / Tool useHighlight

NVIDIA AI's portfolio covers 50 active models across 14 current families, spanning coding, rag, and agents. Open a model detail page to compare provider routes and sourced benchmarks.
Covers 7 workload areas across 50 active tracked models; last verified 2026-07-02.

Use it for

Teams evaluating coding, rag, and agents across this lab's releases
Comparing model families before committing to a flagship
Migration and pricing follow-ups across 50 tracked models

Do not use it for

Choosing a hosting provider without opening a model page for price ladders

Active models

Current models from this lab, excluding deprecated ones

Active families

Current model families from this lab

Open catalog

45 open

0 open source / 45 open weights

Lowest output price

$0.160 /1M

Cheapest tracked output across active models, per 1M tokens

Latest dated release

2026-06-25

Nemotron-Labs TwoTower 30B-A3B Base

Freshness

2026-07-02

Researched 16d ago

fresh

Information

Founded2015

Santa Clara, California, United States

Links

Website GitHub X / Twitter LinkedIn HuggingFace Crunchbase

Release cadence

Showing 5 recent dated releases (full timeline below). Latest: Nemotron-Labs TwoTower 30B-A3B Base (2026-06-25).

Where this lab wins

Coding: 4 tracked models with SWE-bench / HumanEval-style scores.
RAG: 2 tracked models with ruler / needle retrieval benchmarks.
Agentic: 4 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
Long-context: 14 tracked models with context-token or InfiniteBench-class signal.

Flagship quality / price signal

Flagship: Nemotron 3 Super-120B-A12B (best sourced coding quality-per-dollar in this portfolio).

Coding task grade B · swe-bench-verified score 60.47 · cheapest tracked output $0.450 per 1M tokens

NVIDIA AI is an American AI research organization founded in 2015. Accelerated AI for enterprise solutions. NVIDIA AI ships 15 model families totaling 53 models, with the most recent release Nemotron-Labs TwoTower 30B-A3B Base in 2026-06. Notable families include Nemotron-Labs TwoTower, Nemotron 3, and Cosmos 3. Use it as a stable reference for lab background, release coverage, and follow-up model pages as. View official API endpoints, benchmark performance, and coding/agent fit for every NVIDIA AI model.

About

NVIDIA's journey into the realm of artificial intelligence, specifically in the areas of generative AI and large language models (LLMs), is marked by a series of strategic innovations that have equipped the company to lead in the AI landscape. Originally renowned for its high-quality graphics processing units (GPUs) used in gaming and multimedia, NVIDIA shifted gears to address the demands of AI, focusing on quality over sheer volume. This strategic pivot, underscored by the development of pioneering technologies, has attracted tech giants such as Microsoft, Amazon, and Facebook as customers, and has become instrumental in powering large-scale AI applications. NVIDIA's GPUs play a pivotal role in supporting the infrastructure required for complex AI tasks, as evidenced by Microsoft's significant expenditure on NVIDIA's chips 3. The company's engagement with AI began in earnest in the 2010s, laying the foundation for its success in generative AI. The launch of specialized GPUs, like the Tesla series and the introduction of significant architectures like Kepler, marked its early contributions. Additionally, the introduction of CUDA (Compute Unified Device Architecture) in 2006 was a landmark development that unlocked the parallel processing potential of GPUs, extending their applicability beyond traditional graphics tasks to include a wide array of AI applications 9. In 2014, NVIDIA further entrenched its position by introducing the cuDNN (CUDA Deep Neural Network) library, optimizing codes for deep learning models and significantly enhancing training and inference processes 9. Beyond hardware, NVIDIA's commitment to the broader AI ecosystem is evident through initiatives like the NVIDIA Deep Learning Institute (NDLI) and the integration of open-source frameworks. These efforts have cultivated a thriving developer community and accelerated the adoption of NVIDIA's technologies across diverse sectors. The development of the NeMo framework exemplifies this approach; NeMo offers a comprehensive platform for creating custom generative AI, including LLMs and vision language models (VLMs), streamlining the training and deployment of LLMs and making them accessible for global enterprises 1012. NVIDIA's impact on generative AI and LLMs is profound. Its technologies facilitate a plethora of applications from AI-driven video and image generation to the deployment of large language models and recommendation systems 48. Platforms like NVIDIA Omniverse showcase the company’s dedication to pushing AI’s boundaries, particularly in supporting diverse AI applications across industries. Ongoing advancements, such as those in the NeMo framework that offer significant speed enhancements for training LLMs, underscore NVIDIA's constant push for innovation. NVIDIA's remarkable success is a testament to its strategic foresight, technical expertise, and unwavering commitment to fostering a vibrant AI ecosystem.

Featured models

Model	Released	Context	Input price ($/1M)	Output price ($/1M)	License	Openness
Nemotron-Labs TwoTower 30B-A3B Base	2026-06-25	128k	-	-	NVIDIA Open Model	Open weights
Nemotron 3 Ultra	2026-06-04	1m	$0.5	$2.2	NVIDIA Open Model	Open weights
Cosmos 3 Nano	2026-05-31	256k	-	-	OpenMDW 1.1	Open weights

Model families

Nemotron-Labs TwoTower

Nemotron 3

Cosmos 3

Nemotron-Labs-Diffusion

Nemotron-Cascade

NVIDIA Nemotron Nano 12B v2 VL

Nemotron Nano 2

Jet-Nemotron

NV-Embed

Nemotron-4

NemoGuard

NVLM

NVIDIA Llama 3 ChatQA

NeVA

Recent releases

Nemotron-Labs TwoTower 30B-A3B Base- 2026-06-25
Nemotron 3 Ultra- 2026-06-04
Cosmos 3 Nano- 2026-05-31
Cosmos 3 Super- 2026-05-31
Cosmos 3 Super Text2Image- 2026-05-31

Top comparisons

FAQ

Who founded NVIDIA AI and when?

NVIDIA AI was founded in 2015 and is associated with Santa Clara, California, United States.

What models has NVIDIA AI released?

NVIDIA AI ships 53 models across 15 families: Nemotron-Labs TwoTower, Nemotron 3, and Cosmos 3.

Is NVIDIA AI's technology open source?

Some NVIDIA AI models are open-weight (Nemotron-Labs TwoTower 30B-A3B Base, Nemotron 3 Ultra, and Nemotron 3 Nano Omni); others are proprietary (NVIDIA Nemotron Nano 12B v2 VL BF16, NVIDIA Nemotron Nano 9B v2, and Megatron GPT 20B).

Where is NVIDIA AI headquartered?

NVIDIA AI is headquartered in Santa Clara, California, United States.

What is NVIDIA AI known for?

Accelerated AI for enterprise solutions. Its most prominent tracked family is Nemotron-Labs TwoTower.

How can I access NVIDIA AI's models?

NVIDIA AI's models are available via AWS Bedrock, Cloudflare Workers AI, DeepInfra, Fireworks AI, and Microsoft Foundry.

Explore related pages

Nemotron-Labs TwoTower model family Nemotron 3 model family Cosmos 3 model family Nemotron-Labs-Diffusion model family Nemotron-Labs TwoTower 30B-A3B Base model spec Nemotron 3 Ultra model spec Cosmos 3 Nano model spec OpenAI Anthropic AI at Meta

Last reviewed: 2026-07-02. Data sourced from public lab announcements and provider documentation.