Phi-4 Mini
Phi-4 Mini is worth evaluating for long context and classification when its provider route and context window match the workload.
Use it for
- Teams evaluating long context and classification
- Workloads that can use a 128k context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Advancing the state-of-the-art in AI and computing.
Cheapest of 3 routes · Fireworks AI
About
Phi-4 family model from Microsoft Research. Mini variant with efficient performance.
Phi-4 Mini is an open-source model in the Phi-4 family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through Fireworks AI, NVIDIA NIM, and Novita AI, with the cheapest tracked route listed at $0.05 input and $0.15 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 25.2, Massive Multitask Language Understanding 67.3, and MMLU PRO 52.8.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Classification
Q/$ C2 relevant benchmarks in the decision map.
Provider price ladder
Compare all 3Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.900 | $0.900 | Serverless |
| NVIDIA NIM | - | - | ServerlessPartial |
Available via routers & gateways(2)
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
NVIDIA LLM Router Blueprint
RouterNVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Classification
Benchmark scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 25.2 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
| Massive Multitask Language Understanding | 67.3 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
| MMLU PRO | 52.8 | — | https://huggingface.co/microsoft/Phi-4-mini-instruct |
Migration checks
No linked migration route is available for this model yet.
Rankings & picks(1)
Frequently asked questions
What is the context window of Phi-4 Mini?
Phi-4 Mini has a context window of 128k tokens.
How much does Phi-4 Mini cost?
Phi-4 Mini pricing ranges from $0.05/1M to $0.9/1M input tokens depending on the provider.
When was Phi-4 Mini released?
Phi-4 Mini was released on 2024-12-13.
Which providers offer Phi-4 Mini?
Phi-4 Mini is available from 3 providers: Fireworks AI, NVIDIA NIM, Novita AI.
What benchmarks has Phi-4 Mini been tested on?
Phi-4 Mini has been evaluated on 3 benchmarks, including Google-Proof Q&A, Massive Multitask Language Understanding, MMLU PRO.
Advancing the state-of-the-art in AI and computing.
Cheapest of 3 routes · Fireworks AI