LLaVA 13B on Replicate API

Name: LLaVA 13B on Replicate API
Brand: Haotian Liu
SKU: llava-13b-replicate

LLaVA · Haotian Liu

ServerlessOpen Weights

Last refreshed 2026-04-19. Next refresh: weekly.

Why use LLaVA 13B on Replicate API?

Replicate API offers LLaVA 13B with competitive pricing. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Input / 1M

Output / 1M

Cache

Not sourced

Batch

Not sourced

Setup recipe

Python + curl

Install

pip install replicate

Auth

export REPLICATE_API_TOKEN=...

Call

import replicate
output = replicate.run(
    "llava-13b",
    input={"prompt": "Hello"}

Model ID

llava-13b

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# llava-13b format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "llava-13b",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Capabilities

VisionMultimodal

About LLaVA 13B

Original LLaVA (Large Language-and-Vision Assistant) 13B model. Multimodal vision+language model combining a vision encoder with a language model for visual understanding tasks.