LLM Reference

Phi-3 Mini 128K

phi-3-mini-128k

Researched 144d ago

Last refreshed 2026-05-16. Next refresh: weekly.

Open SourceCodingLong contextClassification

Phi-3 Mini 128K is worth evaluating for coding, long context, and classification when its provider route and context window match the workload.

Decision context: Coding task fit, 5 tracked provider routes, and research from 2026-01-01.

Use it for

  • Teams evaluating coding, long context, and classification
  • Workloads that can use a 128K context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows

Cheapest output

$0.100

Fireworks AI per 1M tokens

Provider routes

5

Tracked API hosts

Quality / dollar

Grade A

Ranked by benchmark score divided by cheapest output price

Freshness

2026-01-01

Researched 144d ago

stale

Top use-case fit

Coding

Q/$ A

1 relevant benchmark in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Q/$ B

3 relevant benchmarks in the decision map.

Provider price ladder

Compare all 5
ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.100$0.100
Provisioned
Replicate API$0.050$0.250
Serverless
Microsoft Foundry$0.300$0.900
ServerlessProvisioned
Baseten API--
ServerlessPartial

Benchmark peer barsfor Coding

Migration checks

No linked migration route is available for this model yet.

About

Phi-3 Mini-128K-Instruct, developed by Microsoft, is a 3.8 billion-parameter large language model renowned for its lightweight, open-source architecture. Despite its modest size, it excels in reasoning tasks, particularly in math and logic, and showcases strong code generation capabilities. A standout feature is its remarkable ability to handle up to 128,000 tokens, allowing it to process extensive text documents and codebases efficiently. While it has limitations in factual knowledge and focuses primarily on English, it strikes a balance between performance and efficiency, making it ideal for resource-constrained environments. The model is available on platforms like Azure AI Studio and Hugging Face and benefits from training on high-quality synthetic and publicly available data, with fine-tuning to improve instruction adherence and safety.

Phi-3 Mini 128K has a 128K-token context window.

Phi-3 Mini 128K input tokens at $0.05/1M, output at $0.25/1M.

Capabilities

No model capability flags are currently sourced.

Benchmark Scores(5)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Google-Proof Q&A50.8diamondresearch
HellaSwag90.210-shotresearch
HumanEval75.9pass@1research
Massive Multitask Language Understanding76.55-shotresearch
MMLU PRO43.9https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

FamilyPhi-3
Released2024-04-23
Parameters3.8B
Context128K
ArchitectureDecoder Only
Knowledge cutoff2023-10
Specializationgeneral
Trainingfinetuned

Created by

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States
Founded 1991
Website