Phi 4 Multimodal Instruct
Multimodal
Capabilities
VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution
Providers(2)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Fireworks AI | $0.9 | $0.9 | Serverless | |
| NVIDIA NIM | — | — | Serverless |
Specifications
FamilyPhi-4
Released2025-01-01
Context128K
ArchitectureDecoder Only
Specializationmultimodal
Trainingpretraining
Fine-tuninginstruct