What is Phi-1 used for?

Phi-1 is used for coding, math-heavy prompts, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.

How does Phi-1 compare to Harrier?

Phi-1 by Microsoft Research is strongest where you need coding, while Harrier by Microsoft Research is the closest related family to check for embedding. Phi-1 has 2 listed variants and reaches up to 2k context, while Harrier reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.

Which Phi-1 model should I use?

Phi-1.5 is both the lowest listed input-price option at $0.07/1M input tokens through Microsoft Foundry and the strongest local starting point with 2k context. Use the provider table if latency, deployment type, or output-token pricing matters more than input price.

Phi-1 Models by Microsoft Research

Microsoft ResearchMITOpen Source

2 models2023Up to 2k ctxFrom $0.07/1M input

About

The Phi-1 family of large language models (LLMs), developed by Microsoft, comprises several models designed for specific tasks, primarily focusing on code generation and reasoning. Phi-1, the initial model in the family, is a transformer-based model with 1.3 billion parameters, specializing in basic Python coding 18. Its training utilized a blend of "textbook quality" data sourced from the web and synthetic data generated using GPT-3.5 18. Despite its relatively small size compared to other LLMs, Phi-1 demonstrates impressive accuracy, exceeding 50% on the HumanEval benchmark for simple Python coding tasks 18. Subsequent models in the Phi family, such as Phi-1.5 and Phi-2, build upon this foundation, expanding capabilities to encompass broader natural language tasks while maintaining a focus on efficiency and high-quality data 911. These models showcase Microsoft's research into creating smaller, more efficient LLMs that rival the performance of much larger models 29.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Phi-1.5Current

Use when the workload needs 2k context and 1.3B parameters.

2023-092k context1.3B parameters

Phi-1Current

Use when the workload needs 2k context and 1.3B parameters.

2023-062k context1.3B parameters

Current Phi-1 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Phi-1.5	Use when the workload needs 2k context and 1.3B parameters.	2023-09	2k context1.3B parameters	Current
Phi-1	Use when the workload needs 2k context and 1.3B parameters.	2023-06	2k context1.3B parameters	Current

Release Timeline

2 release groups

2023-09

1 current

Phi-1.5

2k context1.3B parameters

Current

2023-06

1 current

Phi-1

2k context1.3B parameters

Current

Specifications(2 models)

Phi-1 model specifications comparison
Model	Released	Context	Parameters
Phi-1.5	2023-09	2k	1.3B
Phi-1	2023-06	2k	1.3B

Available From(1 provider)

Microsoft Foundry

Pricing

Phi-1 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Phi-1.5	Microsoft Foundry	$0.07	$0.07	Provisioned

Frequently Asked Questions

What is Phi-1 used for?: Phi-1 is used for coding, math-heavy prompts, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Phi-1 compare to Harrier?: Phi-1 by Microsoft Research is strongest where you need coding, while Harrier by Microsoft Research is the closest related family to check for embedding. Phi-1 has 2 listed variants and reaches up to 2k context, while Harrier reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which Phi-1 model should I use?: For the lowest listed input price, start with Phi-1.5 through Microsoft Foundry at $0.07/1M input tokens. For the most capable/latest local choice, evaluate Phi-1.5 with 2k context.