LLM ReferenceLLM Reference
Replicate API

Phi-2 on Replicate API

Phi-2 · Microsoft Research

ServerlessOpen Source

Last refreshed 2026-04-19. Next refresh: weekly.

Why use Phi-2 on Replicate API?

Replicate API offers Phi-2 with pay-as-you-go pricing at $0.05/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare Phi-2 across 5 providers to find the best fit for your use case
Input / 1M
$0.050
Output / 1M
$0.25
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install replicate
Auth
export REPLICATE_API_TOKEN=...
Call
import replicate
output = replicate.run(
    "phi-2",
    input={"prompt": "Hello"}
Model ID
phi-2

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# phi-2 format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "phi-2",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

  • Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
  • The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Compare Phi-2 Across Providers

ProviderInput (per 1M)Output (per 1M)
Microsoft Foundry$0.07$0.07
Cloudflare Workers AI
Together AI$0.10$0.10
Fireworks AI$0.10$0.10
Replicate API$0.05$0.25

Pricing

TypePrice (per 1M)
Input tokens$0.05
Output tokens$0.25

Capabilities

Structured Outputs

About Phi-2

Phi-2 is a compact language model by Microsoft endowed with 2.7 billion parameters and part of their Phi series. It shows formidable capabilities in reasoning and language understanding, outshining much larger models, even those with up to 25 times more parameters. Phi-2's training utilized a vast and diverse dataset of 1.4 trillion tokens, incorporating high-quality synthetic data and curated web content to bolster its common sense reasoning and general knowledge. Interestingly, despite lacking fine-tuning via reinforcement learning from human feedback (RLHF), it exhibits enhanced safety features and reduced bias. This makes Phi-2 a particularly useful asset in natural language processing research and development 127.

FAQ

What does Phi-2 cost on Replicate API?

On Replicate API, Phi-2 costs $0.05 per 1M input tokens and $0.25 per 1M output tokens.

How does Replicate API compare to other Phi-2 providers?

Phi-2 is available from 5 providers. The cheapest input pricing is $0.05/1M tokens from Replicate API.

Who created Phi-2?

Phi-2 was created by Microsoft Research as part of the Phi-2 model family.

Is Phi-2 open source?

Phi-2 is open source according to the seed data.

Get Started