LLM Reference

LLaVA 1.6 Hermes Yi 34B

Released
2024-01-31
Last refreshed
2026-05-01
Status
Researched 154d ago
DeprecatedLong context

LLaVA 1.6 Hermes Yi 34B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 200k context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
LLaVA 1.6
Released
2024-01-31
Context
200k
Parameters
34B
Architecture
Decoder Only
Knowledge cutoff
2024-03
Specialization
general
Training
finetuned
Created by

Academic researcher focused on vision models

N/A
Founded N/A
Website
Pricing
Output / 1M
$0.900
Input / 1M
$0.900

Cheapest of 2 routes · Fireworks AI

About

LLaVA-1.6, specifically the Hermes Yi 34B variant, represents a leap in multimodal AI capabilities, enhanced from its predecessor, LLaVA 1.5. This open-source chatbot excels in processing and responding to both text and image inputs. The model boasts a fourfold increase in image resolution support, enhanced visual reasoning and OCR capabilities, and improved visual conversation and world knowledge. It leverages the Nous-Hermes-2-Yi-34B language model as its backbone, offering superior commercial licenses and bilingual support. LLaVA-1.6-34B outshines other open-source models and even competes with Google's Gemini Pro on some tasks. Its training efficiency is impressive, requiring just one day on 32 A100 GPUs, and a demo for chat, image captioning, and visual question answering is accessible online.

LLaVA 1.6 Hermes Yi 34B is a model in the LLaVA 1.6 family. The structured metadata tracks a 200k-token context window. This page tracks provider routes through NVIDIA NIM and Fireworks AI, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for LLaVA 1.6 Hermes Yi 34B yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.900$0.900
Provisioned
NVIDIA NIM--
ProvisionedPartial

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.