LLM Reference

NVIDIA Llama 3 ChatQA Models by NVIDIA AI

2 models2024Up to 8k ctxFrom $0.37/1M input

About

The NVIDIA Llama 3 ChatQA family of large language models (LLMs) is designed to excel in conversational question answering (QA) and retrieval-augmented generation (RAG). These models are grounded in the Llama 3 base model and leverage an enhanced training methodology from the ChatQA project. A standout feature is their integration of extensive conversational QA data, which enhances their capability to manage tabular data and complex arithmetic calculations. The family offers two primary variants: Llama3-ChatQA-1.5-8B and Llama3-ChatQA-1.5-70B. These variants cater to different performance needs and computational requirements, with the 70B model excelling in reasoning and language understanding. NVIDIA supports these models with comprehensive resources, including benchmark results and detailed documentation, for developers and researchers 14.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view1 retired

Use when the workload needs 8k context and 8B parameters.

2024-088k context8B parameters

Release Timeline

1 release group
2024-08
1 current · 1 retired
NVIDIA Llama 3 ChatQA 70B
8k context70B parameters
Archived
NVIDIA Llama 3 ChatQA 8B
8k context8B parameters
Current

Specifications(2 models)

NVIDIA Llama 3 ChatQA model specifications comparison
ModelReleasedContextParameters
NVIDIA Llama 3 ChatQA 8B2024-088k8B

Available From(2 providers)

Pricing

NVIDIA Llama 3 ChatQA model pricing by provider
ModelProviderInput / 1MOutput / 1MType
NVIDIA Llama 3 ChatQA 8BMicrosoft Foundry$0.37$1.1Provisioned

Frequently Asked Questions

What is NVIDIA Llama 3 ChatQA used for?
The NVIDIA Llama 3 ChatQA family of large language models (LLMs) is designed to excel in conversational question answering (QA) and retrieval-augmented generation (RAG).
How does NVIDIA Llama 3 ChatQA compare to NVIDIA Nemotron Nano 12B v2 VL?
NVIDIA Llama 3 ChatQA by NVIDIA AI is strongest where you need its listed use cases, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. NVIDIA Llama 3 ChatQA has 2 listed variants and reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.
Which NVIDIA Llama 3 ChatQA model should I use?
For the lowest listed input price, start with NVIDIA Llama 3 ChatQA 8B through Microsoft Foundry at $0.37/1M input tokens. For the most capable/latest local choice, evaluate NVIDIA Llama 3 ChatQA 8B with 8k context.

Models(2)