LLM ReferenceLLM Reference
NVIDIA NIM

Phi-3 Medium 4K on NVIDIA NIM

Phi-3 · Microsoft Research

ProvisionedOpen Source

Last refreshed 2026-05-01. Next refresh: weekly.

Why use Phi-3 Medium 4K on NVIDIA NIM?

NVIDIA NIM offers Phi-3 Medium 4K with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Compare Phi-3 Medium 4K across 3 providers to find the best fit for your use case
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: phi-3-medium-4k
Model ID
phi-3-medium-4k

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID phi-3-medium-4k.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare Phi-3 Medium 4K Across Providers

ProviderInput (per 1M)Output (per 1M)
Microsoft Foundry$0.45$1.35
NVIDIA NIM
DeepInfra$0.14$0.41

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

Structured Outputs

About Phi-3 Medium 4K

The Phi-3 Medium 4K, developed by Microsoft, is a state-of-the-art large language model with 14 billion parameters. It is engineered for efficiency across various tasks, particularly excelling in reasoning capabilities. This model is designed to handle 4,096 token context lengths, allowing for the processing of longer input sequences. Leveraging a dense, decoder-only Transformer architecture, it incorporates techniques like supervised fine-tuning and direct preference optimization to align with human preferences and safety standards. The model supports multilingual data, although it is primarily trained in English. Its lightweight nature allows for deployment on diverse hardware platforms, making it accessible and versatile for both commercial and research purposes. Safety measures are embedded, although further precautions are advised for applications with higher risks.

FAQ

What is the context window for Phi-3 Medium 4K on NVIDIA NIM?

Phi-3 Medium 4K supports a 4,000 token context window on NVIDIA NIM.

How does NVIDIA NIM compare to other Phi-3 Medium 4K providers?

Phi-3 Medium 4K is available from 3 providers. The cheapest input pricing is $0.14/1M tokens from DeepInfra.

Who created Phi-3 Medium 4K?

Phi-3 Medium 4K was created by Microsoft Research as part of the Phi-3 model family.

Is Phi-3 Medium 4K open source?

Phi-3 Medium 4K is open source according to the seed data.

Get Started

Model Specs

Released2024-05-21
Parameters14B
Context4K
ArchitectureDecoder Only