LLM Reference

Phi-3 Medium 4K

About

The Phi-3 Medium 4K, developed by Microsoft, is a state-of-the-art large language model with 14 billion parameters. It is engineered for efficiency across various tasks, particularly excelling in reasoning capabilities. This model is designed to handle 4,096 token context lengths, allowing for the processing of longer input sequences. Leveraging a dense, decoder-only Transformer architecture, it incorporates techniques like supervised fine-tuning and direct preference optimization to align with human preferences and safety standards. The model supports multilingual data, although it is primarily trained in English. Its lightweight nature allows for deployment on diverse hardware platforms, making it accessible and versatile for both commercial and research purposes. Safety measures are embedded, although further precautions are advised for applications with higher risks.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

Providers(3)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
Azure OpenAI$0.45$1.35ServerlessProvisioned
NVIDIA NIMProvisioned
DeepInfra$0.14$0.41Serverless

Benchmark Scores(2)

BenchmarkScoreVersionSource
HumanEval52.7pass@1Open LLM Leaderboard
Massive Multitask Language Understanding68.95-shotOpen LLM Leaderboard

Rankings

Specifications

FamilyPhi-3
Released2024-05-21
Parameters14B
Context4K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States
Founded 1991
Website