Nemotron-Labs TwoTower Models by NVIDIA AI
1 model2026Up to 128k ctx
Details
ResearcherNVIDIA AI
LicenseNVIDIA Open Model
Commercial useCommercial use: permitted
Models1
Released2026
Max context128k
About
Nemotron-Labs TwoTower is NVIDIA's open-weight block-wise autoregressive diffusion language model family that adapts a Nemotron-3-Nano-30B-A3B autoregressive backbone into a two-tower architecture with a frozen AR/context tower and a trainable diffusion/denoiser tower.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
1 in view
Use when the workload needs 128k context and open source.
2026-06128k contextopen source
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Nemotron-Labs TwoTower 30B-A3B Base | Use when the workload needs 128k context and open source. | 2026-06 | 128k contextopen source | Current |
Release Timeline
1 release group2026-06
1 current
Nemotron-Labs TwoTower 30B-A3B Base
Current128k contextopen source
Specifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Nemotron-Labs TwoTower 30B-A3B Base | 2026-06 | 128k | ~60B total checkpoint; Hugging Face reports 63B params |
Frequently Asked Questions
- What is Nemotron-Labs TwoTower used for?
- Nemotron-Labs TwoTower is NVIDIA's open-weight block-wise autoregressive diffusion language model family that adapts a Nemotron-3-Nano-30B-A3B autoregressive backbone into a two-tower architecture with a frozen AR/context tower and a trainable diffusion/denoiser tower.
- How does Nemotron-Labs TwoTower compare to NVIDIA Nemotron Nano 12B v2 VL?
- Nemotron-Labs TwoTower by NVIDIA AI is strongest where you need its listed use cases, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron-Labs TwoTower has 1 listed variant and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
- Which Nemotron-Labs TwoTower model should I use?
- If price is the main constraint, use the pricing table first because Nemotron-Labs TwoTower does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Nemotron-Labs TwoTower 30B-A3B Base with 128k context.


