LLM Reference

NeVA Models by NVIDIA AI

NVIDIA AICC-BY-NC-4.0
3 models2024

About

NeVA is a family of multimodal vision-language models developed by NVIDIA within the NeMo Multimodal ecosystem. These models integrate large language models, such as NVGPT or LLaMA, with a vision encoder to interpret and generate human-like responses to both text and images. The NeVA models undergo a two-stage training process involving feature alignment pre-training and end-to-end fine-tuning. They are ideal for tasks that require an advanced understanding of visual content and precise instruction following, showcasing capabilities akin to advanced multimodal models like GPT-4, even when presented with novel inputs. A specific variant, Video NeVA, expands its functionality to include video data processing by converting videos into sequences of image frames. Available in versions with 8B, 22B, and 43B parameters, NeVA models make use of NeMo's framework features for efficient training, including model parallelism and activation checkpointing. It is noteworthy that some NeVA models are restricted to non-commercial use 12.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view1 retired
NeVA 43BCurrent

Use when the workload needs 43B parameters.

2024-0343B parameters
NeVA 8BCurrent

Use when the workload needs 8B parameters.

2024-038B parameters

Release Timeline

1 release group
2024-03
2 current · 1 retired
NeVA 22B
22B parameters
Archived
NeVA 43B
43B parameters
Current
NeVA 8B
8B parameters
Current

Specifications(3 models)

NeVA model specifications comparison
ModelReleasedParameters
NeVA 43B2024-0343B
NeVA 8B2024-038B

Available From(1 provider)

Frequently Asked Questions

What is NeVA used for?
NeVA is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does NeVA compare to NVIDIA Nemotron Nano 12B v2 VL?
NeVA by NVIDIA AI is strongest where you need coding, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. NeVA has 3 listed variants, so compare the specs and pricing tables before choosing a production model.
Which NeVA model should I use?
If price is the main constraint, use the pricing table first because NeVA does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate NeVA 43B.

Models(3)