LLM Reference

Phi-3 Medium 128K

phi-3-medium-128k

Researched 144d ago

Last refreshed 2026-05-16. Next refresh: weekly.

Open SourceCodingLong contextClassification

Phi-3 Medium 128K is worth evaluating for coding, long context, and classification when its provider route and context window match the workload.

Decision context: Coding task fit, 2 tracked provider routes, and research from 2026-01-01.

Use it for

  • Teams evaluating coding, long context, and classification
  • Workloads that can use a 128K context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows

Cheapest output

$1.50

Microsoft Foundry per 1M tokens

Provider routes

2

Tracked API hosts

Quality / dollar

Grade C

Ranked by benchmark score divided by cheapest output price

Freshness

2026-01-01

Researched 144d ago

stale

Top use-case fit

Coding

Q/$ C

1 relevant benchmark in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Q/$ D

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 2
ProviderInput / 1MOutput / 1MRoute
Microsoft Foundry$0.500$1.50
ServerlessProvisioned
NVIDIA NIM--
ProvisionedPartial

Benchmark peer barsfor Coding

Migration checks

No linked migration route is available for this model yet.

About

The Phi-3 Medium 128K is an open-source, 14-billion parameter language model by Microsoft, designed for efficient operation in resource-limited environments. Noted for its state-of-the-art performance on reasoning tasks, it excels in language understanding, code generation, and logical reasoning while offering a long context window of up to 128,000 tokens, making it ideal for applications like summarizing lengthy documents. Its dense decoder-only Transformer architecture has been refined with supervised fine-tuning and preference optimization to enhance instruction-following capabilities. Additionally, Phi-3 Medium 128K is optimized for diverse hardware platforms, ensuring broad accessibility and performance 12.

Phi-3 Medium 128K has a 128K-token context window.

Phi-3 Medium 128K input tokens at $0.5/1M, output at $1.5/1M.

Capabilities

No model capability flags are currently sourced.

Benchmark Scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
HumanEval52.2pass@1https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding75.35-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
MMLU PRO51.9https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

FamilyPhi-3
Released2024-05-21
Parameters14B
Context128K
ArchitectureDecoder Only
Knowledge cutoff2023-10
Specializationgeneral
Trainingfinetuned

Created by

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States
Founded 1991
Website