What is the context window of Nemotron 3 8B?

Nemotron 3 8B has a context window of 4K tokens.

How much does Nemotron 3 8B cost?

Nemotron 3 8B is available at $0.37/1M input tokens through Microsoft Foundry.

When was Nemotron 3 8B released?

Nemotron 3 8B was released on 2026-03-01.

Which providers offer Nemotron 3 8B?

Nemotron 3 8B is available from 1 provider: Microsoft Foundry.

Nemotron 3 8B

Name: Nemotron 3 8B
Author: NVIDIA AI

About

Nemotron-3 8B is a series of large language models from NVIDIA, geared towards corporate applications for developing bespoke LLMs. Utilizing a GPT-3-style transformer architecture, the core model features 8 billion parameters and supports a 4,096 token context length. This model forms the backbone for specialized variants like Nemotron-3-8B-Base-4k for customization, Nemotron-3-8B-Chat models allowing for steerable outputs and refined via RLHF, and Nemotron-3-8B-QA, optimized for question-answering. Compatible with the NVIDIA NeMo framework, these models support fine-tuning methods such as LoRA and are designed for efficient deployment on NVIDIA GPUs. They have been trained on extensive multilingual data containing 3.5 to 3.8 trillion tokens across a diverse range of languages and evaluation benchmarks, although they may exhibit biases and inaccuracies due to their training data.