Last refreshed 2026-04-15. Next refresh: weekly.
Why use Flan-T5 XL on Replicate API?
Replicate API offers Flan-T5 XL with competitive pricing. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.
Compare Flan-T5 XL across 2 providers to find the best fit for your use caseInput / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Python + curlInstall
pip install replicateAuth
export REPLICATE_API_TOKEN=...Call
import replicate
output = replicate.run(
"flan-t5-xl",
input={"prompt": "Hello"}Model ID
flan-t5-xlRequest example
import replicate
# reads REPLICATE_API_TOKEN from env
# flan-t5-xl format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
"flan-t5-xl",
input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))Gotchas
- Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
- The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.
Compare Flan-T5 XL Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| IBM watsonx | $0.60 | $0.60 |
| Replicate API | — | — |
Capabilities
No model capability flags are currently sourced.
FAQ
How does Replicate API compare to other Flan-T5 XL providers?
Flan-T5 XL is available from 2 providers. The cheapest input pricing is $0.6/1M tokens from IBM watsonx.
Who created Flan-T5 XL?
Flan-T5 XL was created by Google DeepMind as part of the FLAN-T5 model family.
Is Flan-T5 XL open source?
Flan-T5 XL's open source status is unknown in the seed data.
Get Started
Model Specs
Released2022-10-03
Parameters3B
ArchitectureDecoder Only