Phi-3 Medium 128K
phi-3-medium-128k
Last refreshed 2026-05-16. Next refresh: weekly.
Phi-3 Medium 128K is worth evaluating for coding, long context, and classification when its provider route and context window match the workload.
Decision context: Coding task fit, 2 tracked provider routes, and research from 2026-01-01.
Use it for
- Teams evaluating coding, long context, and classification
- Workloads that can use a 128K context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest output
$1.50
Microsoft Foundry per 1M tokens
Provider routes
2
Tracked API hosts
Quality / dollar
Grade C
Ranked by benchmark score divided by cheapest output price
Freshness
2026-01-01
Researched 144d ago
Top use-case fit
Coding
Q/$ C1 relevant benchmark in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Classification
Q/$ D2 relevant benchmarks in the decision map.
Provider price ladder
Compare all 2| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Microsoft Foundry | $0.500 | $1.50 | ServerlessProvisioned |
| NVIDIA NIM | - | - | ProvisionedPartial |
Benchmark peer barsfor Coding
Migration checks
No linked migration route is available for this model yet.
About
The Phi-3 Medium 128K is an open-source, 14-billion parameter language model by Microsoft, designed for efficient operation in resource-limited environments. Noted for its state-of-the-art performance on reasoning tasks, it excels in language understanding, code generation, and logical reasoning while offering a long context window of up to 128,000 tokens, making it ideal for applications like summarizing lengthy documents. Its dense decoder-only Transformer architecture has been refined with supervised fine-tuning and preference optimization to enhance instruction-following capabilities. Additionally, Phi-3 Medium 128K is optimized for diverse hardware platforms, ensuring broad accessibility and performance 12.
Phi-3 Medium 128K has a 128K-token context window.
Phi-3 Medium 128K input tokens at $0.5/1M, output at $1.5/1M.
Capabilities
No model capability flags are currently sourced.
Benchmark Scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| HumanEval | 52.2 | pass@1 | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| Massive Multitask Language Understanding | 75.3 | 5-shot | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| MMLU PRO | 51.9 | — | https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro |