LLM ReferenceLLM Reference
Replicate API

Claude 3 Haiku on Replicate API

Claude 3 · Anthropic

Serverless

Last refreshed 2026-04-19. Next refresh: weekly.

Why use Claude 3 Haiku on Replicate API?

Replicate API offers Claude 3 Haiku with pay-as-you-go pricing at $0.25/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare Claude 3 Haiku across 6 providers to find the best fit for your use case
Input / 1M
$0.25
Output / 1M
$1.25
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install replicate
Auth
export REPLICATE_API_TOKEN=...
Call
import replicate
output = replicate.run(
    "claude-3-haiku",
    input={"prompt": "Hello"}
Model ID
claude-3-haiku

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# claude-3-haiku format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "claude-3-haiku",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

  • Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
  • The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Compare Claude 3 Haiku Across Providers

ProviderInput (per 1M)Output (per 1M)
AWS Bedrock$0.25$1.25
GCP Vertex AI$0.25$1.25
Salesforce Einstein Generative AI
Anthropic$0.25$1.25
OpenRouter$0.25$1.25
View all 6 providers →

Pricing

TypePrice (per 1M)
Input tokens$0.25
Output tokens$1.25

Capabilities

VisionMultimodalReasoningStructured OutputsCode Execution

About Claude 3 Haiku

Claude 3 Haiku available on AWS Bedrock

FAQ

What does Claude 3 Haiku cost on Replicate API?

On Replicate API, Claude 3 Haiku costs $0.25 per 1M input tokens and $1.25 per 1M output tokens.

What is the context window for Claude 3 Haiku on Replicate API?

Claude 3 Haiku supports a 200,000 token context window on Replicate API.

How does Replicate API compare to other Claude 3 Haiku providers?

Claude 3 Haiku is available from 6 providers. The cheapest input pricing is $0.25/1M tokens from AWS Bedrock.

Who created Claude 3 Haiku?

Claude 3 Haiku was created by Anthropic as part of the Claude 3 model family.

Is Claude 3 Haiku open source?

Claude 3 Haiku's open source status is unknown in the seed data.

Get Started

Model Specs

Released2024-03-04
Parameters20B
Context200K
ArchitectureDecoder Only
Knowledge cutoff2023-08

Related Models on Replicate API