LLM Reference
NVIDIA NIM

LLaVA 1.6 Hermes Yi 34B on NVIDIA NIM

LLaVA 1.6 · Haotian Liu

Provisioned

Pricing

TypePrice (per 1M)
Input tokensFree
Output tokensFree

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About LLaVA 1.6 Hermes Yi 34B

LLaVA-1.6, specifically the Hermes Yi 34B variant, represents a leap in multimodal AI capabilities, enhanced from its predecessor, LLaVA 1.5. This open-source chatbot excels in processing and responding to both text and image inputs. The model boasts a fourfold increase in image resolution support, enhanced visual reasoning and OCR capabilities, and improved visual conversation and world knowledge. It leverages the Nous-Hermes-2-Yi-34B language model as its backbone, offering superior commercial licenses and bilingual support. LLaVA-1.6-34B outshines other open-source models and even competes with Google's Gemini Pro on some tasks. Its training efficiency is impressive, requiring just one day on 32 A100 GPUs, and a demo for chat, image captioning, and visual question answering is accessible online.

Get Started

Model Specs

Released2024-01-31
Parameters34B
Context200K
ArchitectureDecoder Only
Knowledge cutoff2024-03

Related Models on NVIDIA NIM