NeVA 8B
NeVA 8B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- NeVA
- Released
- 2024-03-01
- Parameters
- 8B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
About
NeVA is NVIDIA's version of LLaVA, a multimodal vision-language model engineered to interpret and respond to inputs involving both text and images. Built on a transformer architecture, NeVA integrates a GPT language model available in 8B, 22B, and 43B parameter versions, alongside a CLIP vision encoder (ViT-L/14). The model's projection layer facilitates the seamless combination of visual data with textual information. NeVA's two-stage training process involves pretraining on image-caption pairs and finetuning with synthetic instruction data, enabling it to adeptly handle complex, multimodal prompts. It excels in generating responses to queries involving images, offering visual comprehension, and creating textual descriptions of visual content. Deployed using NVIDIA's Triton inference server, it benefits from the NeMo LLM framework's efficient training capabilities.
NeVA 8B is a model in the NeVA family. No headline benchmark score is tracked for NeVA 8B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.