LLM Reference

Phi-4 Models by Microsoft Research

9 models2024–2026Up to 128k ctxFrom $0.05/1M input

About

Phi-4 is a family of 9 AI models by Microsoft Research, released between 2024 and 2026.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

9 in view

Use when the workload needs reasoning, 128k context, and 3.8B parameters.

2026-05reasoning128k context3.8B parameters

Use when the workload needs 15B parameters and multimodal inputs.

2026-0315B parametersmultimodal inputs

Use when the workload needs reasoning, 128k context, and 3.8B parameters.

2025-12reasoning128k context3.8B parameters

Use when the workload needs multimodal, 128k context, and 5.6B parameters.

2025-01multimodal128k context5.6B parameters

Use when the workload needs reasoning, 128k context, and 14B parameters.

2025-01reasoning128k context14B parameters

Use when the workload needs reasoning, 128k context, and 14B parameters.

2025-01reasoning128k context14B parameters

Use when the workload needs 128k context, 5.6B parameters, and multimodal inputs.

2025-01128k context5.6B parametersmultimodal inputs
Phi-4 14BCurrent

Use when the workload needs 16k context, 14B parameters, and structured outputs.

2024-1216k context14B parametersstructured outputs
Phi-4 MiniCurrent

Use when the workload needs 128k context and 3.8B parameters.

2024-12128k context3.8B parameters

Release Timeline

5 release groups
2026-05
1 current
Phi-4 Mini Reasoning
reasoning128k context3.8B parameters
Current
2026-03
1 current
Phi-4 Reasoning Vision 15B
15B parametersmultimodal inputs
Current
2025-12
1 current
Phi-4 Mini Flash Reasoning
reasoning128k context3.8B parameters
Current
2025-01
4 current
Phi 4 Multimodal Instruct
multimodal128k context5.6B parameters
Current
Phi 4 Reasoning
reasoning128k context14B parameters
Current
Phi 4 Reasoning Plus
reasoning128k context14B parameters
Current
Phi-4 Multimodal
128k context5.6B parametersmultimodal inputs
Current
2024-12
2 current
Phi-4 14B
16k context14B parametersstructured outputs
Current
Phi-4 Mini
128k context3.8B parameters
Current

Specifications(9 models)

Phi-4 model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningStructured Outputs
Phi-4 Mini Reasoning2026-05128k3.8BNoNoYesNo
Phi-4 Reasoning Vision 15B2026-0315BNoYesNoNo
Phi-4 Mini Flash Reasoning2025-12128k3.8BNoNoYesNo
Phi 4 Multimodal Instruct2025-01128k5.6BYesYesNoNo
Phi 4 Reasoning Plus2025-01128k14BNoNoYesNo
Phi 4 Reasoning2025-01128k14BNoNoYesNo
Phi-4 Multimodal2025-01128k5.6BNoYesNoNo
Phi-4 14B2024-1216k14BNoNoNoYes
Phi-4 Mini2024-12128k3.8BNoNoNoNo

Available From(5 providers)

Pricing

Phi-4 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Phi-4 MiniNovita AI$0.05$0.15Serverless
Phi-4 14BOpenRouter$0.065$0.14Serverless
Phi 4 Reasoning PlusMicrosoft Foundry$0.125$0.5Serverless
Phi 4 Reasoning PlusFireworks AI$0.5$0.5Serverless
Phi 4 ReasoningFireworks AI$0.5$0.5Serverless
Phi-4 14BFireworks AI$0.9$0.9Serverless
Phi-4 MiniFireworks AI$0.9$0.9Serverless
Phi 4 Multimodal InstructFireworks AI$0.9$0.9Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is Phi-4 used for?
Phi-4 is used for reasoning, multimodal, and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Phi-4 compare to Harrier?
Phi-4 by Microsoft Research is strongest where you need reasoning, while Harrier by Microsoft Research is the closest related family to check for embedding. Phi-4 has 9 listed variants and reaches up to 128k context, while Harrier reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which Phi-4 model should I use?
For the lowest listed input price, start with Phi-4 Mini through Novita AI at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Phi 4 Multimodal Instruct with 128k context and multimodal inputs.

Models(9)