LLaVA 1.6 Hermes Yi 34B

Name: LLaVA 1.6 Hermes Yi 34B
Author: Haotian Liu

Released

2024-01-31

Last refreshed

2026-05-01

Status

Researched 198d ago

DeprecatedOpen sourceCommercial use: permittedLong context

LLaVA 1.6 Hermes Yi 34B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

Teams maintaining an existing integration
Workloads that can use a 200k context window
Buyers comparing 2 tracked provider routes

Do not use it for

New production launches
Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: LLaVA 1.6
Released: 2024-01-31
Context: 200k
Parameters: 34B
Architecture: Decoder Only
Knowledge cutoff: 2024-03
Specialization: general
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

Haotian Liu

Academic researcher focused on vision models

N/A

Founded N/A

Website

Pricing

Output / 1M

$0.900

Input / 1M

$0.900

Cheapest of 2 routes · Fireworks AI

Providers(2)

NVIDIA NIM Fireworks AI

View 2 provider routes

About

LLaVA-1.6, specifically the Hermes Yi 34B variant, represents a leap in multimodal AI capabilities, enhanced from its predecessor, LLaVA 1.5. This open-source chatbot excels in processing and responding to both text and image inputs. The model boasts a fourfold increase in image resolution support, enhanced visual reasoning and OCR capabilities, and improved visual conversation and world knowledge. It leverages the Nous-Hermes-2-Yi-34B language model as its backbone, offering superior commercial licenses and bilingual support. LLaVA-1.6-34B outshines other open-source models and even competes with Google's Gemini Pro on some tasks. Its training efficiency is impressive, requiring just one day on 32 A100 GPUs, and a demo for chat, image captioning, and visual question answering is accessible online.

LLaVA 1.6 Hermes Yi 34B is an open-source model in the LLaVA 1.6 family. The structured metadata tracks a 200k-token context window. This page tracks provider routes through NVIDIA NIM and Fireworks AI, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for LLaVA 1.6 Hermes Yi 34B yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Fireworks AI	$0.900	$0.900	Provisioned
NVIDIA NIM	-	-	ProvisionedPartial

Available via routers & gateways(2)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughFireworks AI

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM