LLM Reference

Nemotron-Labs TwoTower Models by NVIDIA AI

NVIDIA AINVIDIA Open ModelOpen weightsOpen Source
1 model2026Up to 128k ctx

Details

ResearcherNVIDIA AI
Commercial useCommercial use: permitted
Models1
Released2026
Max context128k

About

Nemotron-Labs TwoTower is NVIDIA's open-weight block-wise autoregressive diffusion language model family that adapts a Nemotron-3-Nano-30B-A3B autoregressive backbone into a two-tower architecture with a frozen AR/context tower and a trainable diffusion/denoiser tower.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view

Use when the workload needs 128k context and open source.

2026-06128k contextopen source

Release Timeline

1 release group
2026-06
1 current
Current

Specifications(1 models)

Nemotron-Labs TwoTower model specifications comparison
ModelReleasedContextParameters
Nemotron-Labs TwoTower 30B-A3B Base2026-06128k~60B total checkpoint; Hugging Face reports 63B params

Frequently Asked Questions

What is Nemotron-Labs TwoTower used for?
Nemotron-Labs TwoTower is NVIDIA's open-weight block-wise autoregressive diffusion language model family that adapts a Nemotron-3-Nano-30B-A3B autoregressive backbone into a two-tower architecture with a frozen AR/context tower and a trainable diffusion/denoiser tower.
How does Nemotron-Labs TwoTower compare to NVIDIA Nemotron Nano 12B v2 VL?
Nemotron-Labs TwoTower by NVIDIA AI is strongest where you need its listed use cases, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron-Labs TwoTower has 1 listed variant and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which Nemotron-Labs TwoTower model should I use?
If price is the main constraint, use the pricing table first because Nemotron-Labs TwoTower does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Nemotron-Labs TwoTower 30B-A3B Base with 128k context.