LLaVA 13B
llava-13b
Last refreshed 2026-04-19. Next refresh: weekly.
LLaVA 13B is worth evaluating for vision when its provider route and context window match the workload.
Decision context: Vision task fit, 1 tracked provider route, and research from 2026-01-01.
Use it for
- Teams evaluating vision
- Workloads that can use a 4K context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
Cheapest output
-
Replicate API per 1M tokens
Provider routes
1
Tracked API hosts
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-01-01
Researched 137d ago
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Replicate API | - | - | ServerlessPartial |
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
Original LLaVA (Large Language-and-Vision Assistant) 13B model. Multimodal vision+language model combining a vision encoder with a language model for visual understanding tasks.
LLaVA 13B has a 4K-token context window.