Phi-4 Mini
phi-4-mini
Last refreshed 2026-05-11. Next refresh: weekly.
Phi-4 Mini is worth evaluating for classification when its provider route and context window match the workload.
Decision context: Classification task fit, 3 tracked provider routes, and research from 2026-01-01.
Use it for
- Teams evaluating classification
- Buyers comparing 3 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest output
$0.150
Novita AI per 1M tokens
Provider routes
3
Tracked API hosts
Quality / dollar
Grade B
Ranked by benchmark score divided by cheapest output price
Freshness
2026-01-01
Researched 137d ago
Top use-case fit
Classification
Q/$ B2 relevant benchmarks in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Novita AI | $0.050 | $0.150 | Serverless |
| Fireworks AI | $0.900 | $0.900 | Serverless |
| NVIDIA NIM | - | - | ServerlessPartial |
Benchmark peer barsfor Classification
Migration checks
No linked migration route is available for this model yet.
About
Phi-4 family model from Microsoft Research. Mini variant with efficient performance.
Phi-4 Mini input tokens at $0.05/1M, output at $0.15/1M.
Capabilities
No model capability flags are currently sourced.
Benchmark Scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 25.2 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
| Massive Multitask Language Understanding | 67.3 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
| MMLU PRO | 52.8 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
Created by
Advancing the state-of-the-art in AI and computing.