LLM ReferenceLLM Reference
Replicate API

Claude 3.5 Sonnet on Replicate API

Claude 3.5 · Anthropic

Serverless

Last refreshed 2026-04-19. Next refresh: weekly.

Why use Claude 3.5 Sonnet on Replicate API?

Replicate API offers Claude 3.5 Sonnet with pay-as-you-go pricing at $3.00/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare Claude 3.5 Sonnet across 6 providers to find the best fit for your use case
Input / 1M
$3.00
Output / 1M
$15.00
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install replicate
Auth
export REPLICATE_API_TOKEN=...
Call
import replicate
output = replicate.run(
    "claude-3.5-sonnet",
    input={"prompt": "Hello"}
Model ID
claude-3.5-sonnet

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# claude-3.5-sonnet format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "claude-3.5-sonnet",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

  • Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
  • The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Compare Claude 3.5 Sonnet Across Providers

ProviderInput (per 1M)Output (per 1M)
GCP Vertex AI$3.00$15.00
AWS Bedrock$3.00$15.00
Anthropic$3.00$15.00
OpenRouter
Microsoft Foundry
View all 6 providers →

Pricing

TypePrice (per 1M)
Input tokens$3.00
Output tokens$15.00

Capabilities

VisionMultimodalReasoningFunction CallingStructured OutputsCode Execution

About Claude 3.5 Sonnet

Claude 3.5 Sonnet, the latest in Anthropic's line of large language models, merges state-of-the-art reasoning, coding, and natural language understanding capabilities with advanced multi-modal processing. Released in October 2024, it excels in benchmarks against previous models and competitors, thanks to its scalable attention mechanisms and massive neural network architecture. Its dynamic routing enables specialization in various tasks, supporting applications from software development and data analysis to customer support and content creation. Users benefit from its "Artifacts" feature for real-time collaborative workflows and can access the model through platforms like Claude.ai and APIs at competitive pricing rates.

FAQ

What does Claude 3.5 Sonnet cost on Replicate API?

On Replicate API, Claude 3.5 Sonnet costs $3 per 1M input tokens and $15 per 1M output tokens.

What is the context window for Claude 3.5 Sonnet on Replicate API?

Claude 3.5 Sonnet supports a 200,000 token context window on Replicate API.

How does Replicate API compare to other Claude 3.5 Sonnet providers?

Claude 3.5 Sonnet is available from 6 providers. The cheapest input pricing is $3/1M tokens from GCP Vertex AI.

Who created Claude 3.5 Sonnet?

Claude 3.5 Sonnet was created by Anthropic as part of the Claude 3.5 model family.

Is Claude 3.5 Sonnet open source?

Claude 3.5 Sonnet's open source status is unknown in the seed data.

Get Started

Model Specs

Released2024-06-20
Parameters70B
Context200K
ArchitectureDecoder Only
Knowledge cutoff2024-04

Related Models on Replicate API