LLM Reference

Phi-1 Models by Microsoft Research

Microsoft ResearchMITOpen Source
2 models2023Up to 2k ctxFrom $0.07/1M input

About

The Phi-1 family of large language models (LLMs), developed by Microsoft, comprises several models designed for specific tasks, primarily focusing on code generation and reasoning. Phi-1, the initial model in the family, is a transformer-based model with 1.3 billion parameters, specializing in basic Python coding 18. Its training utilized a blend of "textbook quality" data sourced from the web and synthetic data generated using GPT-3.5 18. Despite its relatively small size compared to other LLMs, Phi-1 demonstrates impressive accuracy, exceeding 50% on the HumanEval benchmark for simple Python coding tasks 18. Subsequent models in the Phi family, such as Phi-1.5 and Phi-2, build upon this foundation, expanding capabilities to encompass broader natural language tasks while maintaining a focus on efficiency and high-quality data 911. These models showcase Microsoft's research into creating smaller, more efficient LLMs that rival the performance of much larger models 29.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view
Phi-1.5Current

Use when the workload needs 2k context and 1.3B parameters.

2023-092k context1.3B parameters
Phi-1Current

Use when the workload needs 2k context and 1.3B parameters.

2023-062k context1.3B parameters

Release Timeline

2 release groups
2023-09
1 current
Phi-1.5
2k context1.3B parameters
Current
2023-06
1 current
Phi-1
2k context1.3B parameters
Current

Specifications(2 models)

Phi-1 model specifications comparison
ModelReleasedContextParameters
Phi-1.52023-092k1.3B
Phi-12023-062k1.3B

Available From(1 provider)

Pricing

Phi-1 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Phi-1.5Microsoft Foundry$0.07$0.07Provisioned

Frequently Asked Questions

What is Phi-1 used for?
Phi-1 is used for coding, math-heavy prompts, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Phi-1 compare to Harrier?
Phi-1 by Microsoft Research is strongest where you need coding, while Harrier by Microsoft Research is the closest related family to check for embedding. Phi-1 has 2 listed variants and reaches up to 2k context, while Harrier reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which Phi-1 model should I use?
For the lowest listed input price, start with Phi-1.5 through Microsoft Foundry at $0.07/1M input tokens. For the most capable/latest local choice, evaluate Phi-1.5 with 2k context.

Models(2)