LLM Reference

Phi-3 Models by Microsoft Research

Microsoft ResearchMITOpen Source
15 models2024Up to 128K ctxFrom $0.05/1M input

About

The Phi-3 family, developed by Microsoft, consists of small language models (SLMs) optimized for Azure AI 1. These models are known for their capability and cost-effectiveness, outperforming larger models in tasks such as language processing, reasoning, coding, and math 1. The Phi-3 lineup includes models like the Phi-3-mini with 3.8 billion parameters, and the Phi-3-small and Phi-3-medium, each with 7 billion and 14 billion parameters, respectively 1. Notably, the Phi-3-mini supports up to a 128K token context window with minimal quality impact, a feature rare for models of its size 1. These models are instruction-tuned for straightforward usage and are optimized for various processing platforms, including GPUs, CPUs, and even mobile hardware 1. Despite their prowess, the Phi-3 models might perform slightly below larger models on factual knowledge tests due to their relatively smaller size 1.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

15 in view

Use when the workload needs 128K context and 3.8B parameters.

2024-08128K context3.8B parameters

Use when the workload needs 128K context.

2024-08128K context

Use when the workload needs 128K context, 4.1B parameters, and multimodal inputs.

2024-08128K context4.1B parametersmultimodal inputs

Use when the workload needs 3.3B parameters.

2024-063.3B parameters

Use when the workload needs 128K context and 14B parameters.

2024-05128K context14B parameters

Use when the workload needs 4K context, 14B parameters, and structured outputs.

2024-054K context14B parametersstructured outputs

Use when the workload needs 128K context and 7B parameters.

2024-05128K context7B parameters

Use when the workload needs 8K context and 7B parameters.

2024-058K context7B parameters

Use when the workload needs 128K context, 4.2B parameters, and multimodal inputs.

2024-05128K context4.2B parametersmultimodal inputs

Use when the workload needs 4K context, 3.8B parameters, and structured outputs.

2024-054K context3.8B parametersstructured outputs

Use when the workload needs 128K context, 7B parameters, and structured outputs.

2024-05128K context7B parametersstructured outputs

Use when the workload needs 128K context and 3.8B parameters.

2024-04128K context3.8B parameters

Use when the workload needs 4K context and 3.8B parameters.

2024-044K context3.8B parameters
Phi-3 MiniCurrent

Use when the workload needs 4K context and 3.8B parameters.

2024-044K context3.8B parameters

Use when the workload needs 4K context and 14B parameters.

2024-044K context14B parameters

Release Timeline

4 release groups
2024-08
3 current
Phi 3.5 Mini Instruct
128K context3.8B parameters
Current
Current
Phi 3.5 Vision Instruct
128K context4.1B parametersmultimodal inputs
Current
2024-06
1 current
Phi-3 Silica
3.3B parameters
Current
2024-05
7 current
DeepInfra Phi 3 Mini 4K Instruct
4K context3.8B parametersstructured outputs
Current
DeepInfra Phi 3 Small 128K Instruct
128K context7B parametersstructured outputs
Current
Phi-3 Medium 128K
128K context14B parameters
Current
Phi-3 Medium 4K
4K context14B parametersstructured outputs
Current
Phi-3 Small 128K
128K context7B parameters
Current
Phi-3 Small 8K
8K context7B parameters
Current
Phi-3 Vision
128K context4.2B parametersmultimodal inputs
Current
2024-04
4 current
Phi-3 Medium
4K context14B parameters
Current
Phi-3 Mini
4K context3.8B parameters
Current
Phi-3 Mini 128K
128K context3.8B parameters
Current
Phi-3 Mini 4k
4K context3.8B parameters
Current

Specifications(15 models)

Phi-3 model specifications comparison
ModelReleasedContextParametersVisionMultimodalStructured Outputs
Phi 3.5 Mini Instruct2024-08128K3.8BNoNoNo
Phi 3.5 MoE Instruct2024-08128K16x3.8B (42B, 6.6B active)NoNoNo
Phi 3.5 Vision Instruct2024-08128K4.1BYesYesNo
Phi-3 Silica2024-063.3BNoNoNo
Phi-3 Medium 128K2024-05128K14BNoNoNo
Phi-3 Medium 4K2024-054K14BNoNoYes
Phi-3 Small 128K2024-05128K7BNoNoNo
Phi-3 Small 8K2024-058K7BNoNoNo
Phi-3 Vision2024-05128K4.2BYesNoNo
DeepInfra Phi 3 Mini 4K Instruct2024-054K3.8BNoNoYes
DeepInfra Phi 3 Small 128K Instruct2024-05128k7BNoNoYes
Phi-3 Mini 128K2024-04128K3.8BNoNoNo
Phi-3 Mini 4k2024-044K3.8BNoNoNo
Phi-3 Mini2024-044K3.8BNoNoNo
Phi-3 Medium2024-044K14BNoNoNo

Available From(6 providers)

Pricing

Phi-3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepInfra Phi 3 Mini 4K InstructDeepInfra$0.05$0.15Serverless
Phi-3 Mini 128KReplicate API$0.05$0.25Serverless
Phi-3 Mini 4kReplicate API$0.05$0.25Serverless
Phi-3 Mini 128KFireworks AI$0.1$0.1Provisioned
Phi-3 Medium 4KDeepInfra$0.14$0.41Serverless
Phi-3 VisionFireworks AI$0.2$0.2Serverless
Phi-3 Mini 4kMicrosoft Foundry$0.28$0.84Serverless
Phi-3 VisionMicrosoft Foundry$0.28$0.84Provisioned
Phi-3 Mini 128KMicrosoft Foundry$0.3$0.9Serverless
Phi-3 Small 8KMicrosoft Foundry$0.32$0.96Serverless
Phi-3 Small 128KMicrosoft Foundry$0.35$1.05Serverless
Phi-3 Medium 4KMicrosoft Foundry$0.45$1.35Serverless
DeepInfra Phi 3 Small 128K InstructDeepInfra$0.45$0.65Serverless
Phi-3 Medium 128KMicrosoft Foundry$0.5$1.5Serverless
Phi 3.5 MoE InstructFireworks AI$0.5$0.5Serverless
Phi 3.5 Mini InstructFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Phi-3 used for?
Phi-3 is used for vision and multimodal work, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Phi-3 compare to Harrier?
Phi-3 by Microsoft Research is strongest where you need vision and multimodal work, while Harrier by Microsoft Research is the closest related family to check for embedding. Phi-3 has 15 listed variants and reaches up to 128K context, while Harrier reaches up to 33K context, so compare the specs and pricing tables before choosing a production model.
Which Phi-3 model should I use?
For the lowest listed input price, start with DeepInfra Phi 3 Mini 4K Instruct through DeepInfra at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Phi 3.5 Vision Instruct with 128K context and multimodal inputs.

Models(15)