LLM Reference

Phi 4 Multimodal Instruct

Multimodal

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

Providers(2)

ProviderInput (per 1M)Output (per 1M)Type
Fireworks AI$0.9$0.9Serverless
NVIDIA NIMServerless

Rankings

Specifications

FamilyPhi-4
Released2025-01-01
Context128K
ArchitectureDecoder Only
Specializationmultimodal
Trainingpretraining
Fine-tuninginstruct