LLM Reference

Phi 3.5 Mini Instruct

Released
2024-08-20
Last refreshed
2026-05-19
Status
Researched 16d ago
Open SourceLong context

Phi 3.5 Mini Instruct is worth evaluating for long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context
  • Workloads that can use a 128k context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Phi-3
Released
2024-08-20
Context
128k
Parameters
3.8B
Architecture
Decoder Only
Knowledge cutoff
2023-10
Specialization
general
License
MIT
Training
finetuned
Created by

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States
Founded 1991
Website
Pricing
Output / 1M
$0.900
Input / 1M
$0.900

Cheapest of 2 routes · Fireworks AI

About

Phi 3.5 Mini Instruct is Microsoft Research's Phi-3 model. It offers a 128K-token context window with weights openly available for self-hosting.

Phi 3.5 Mini Instruct is an open-source model in the Phi-3 family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through Fireworks AI and NVIDIA NIM, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for Phi 3.5 Mini Instruct yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.900$0.900
Serverless
NVIDIA NIM--
ServerlessPartial

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(8)

Comparison and alternatives

Browse all comparisons →