GPT-4 Turbo on Replicate API

Name: GPT-4 Turbo on Replicate API
Brand: OpenAI
SKU: gpt-4-turbo-replicate
Price: 5 USD

GPT-4 · OpenAI

Serverless

Last refreshed 2026-05-10. Next refresh: weekly.

Why use GPT-4 Turbo on Replicate API?

Replicate API offers GPT-4 Turbo with pay-as-you-go pricing at $5.00/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare GPT-4 Turbo across 6 providers to find the best fit for your use case

Input / 1M

$5.00

Output / 1M

$15.00

Cache

Not sourced

Batch

Not sourced

Setup recipe

Python + curl

Install

pip install replicate

Auth

export REPLICATE_API_TOKEN=...

Call

import replicate
output = replicate.run(
    "gpt-4-turbo",
    input={"prompt": "Hello"}

Model ID

gpt-4-turbo

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# gpt-4-turbo format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "gpt-4-turbo",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.