LLM ReferenceLLM Reference

LLaVA 13B

llava-13b

Researched 137d ago

Last refreshed 2026-04-19. Next refresh: weekly.

Open SourceMultimodalVision

LLaVA 13B is worth evaluating for vision when its provider route and context window match the workload.

Decision context: Vision task fit, 1 tracked provider route, and research from 2026-01-01.

Use it for

  • Teams evaluating vision
  • Workloads that can use a 4K context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Strict JSON or tool-calling flows

Cheapest output

-

Replicate API per 1M tokens

Provider routes

1

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-01-01

Researched 137d ago

stale

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Replicate API--
ServerlessPartial

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Original LLaVA (Large Language-and-Vision Assistant) 13B model. Multimodal vision+language model combining a vision encoder with a language model for visual understanding tasks.

LLaVA 13B has a 4K-token context window.

Capabilities

VisionMultimodal

Rankings

Specifications

FamilyLLaVA
Released2023-04-17
Parameters13B
Context4K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuned

Created by

Academic researcher focused on vision models

N/A
Founded N/A
Website

Providers(1)