Which Microsoft Foundry model is cheapest?

The cheapest Microsoft Foundry model in this catalog is Mistral Ministral 3B at $0.04/1M input tokens.

What is the context window for Microsoft Foundry models?

Microsoft Foundry models listed here range from 512 to 10m tokens of context.

How does Microsoft Foundry compare to AWS Bedrock?

Microsoft Foundry lists 135 models here, while AWS Bedrock lists 130. Compare pricing availability, context windows, and benchmark coverage before choosing a host.

Microsoft Foundry Models — Pricing & Benchmarks

135 models available · Microsoft

Microsoft Foundry hosts 135 AI models in this catalog. The lowest listed input price is Mistral Ministral 3B at $0.04/1M input tokens. LLM Reference lets you compare these models across all 80 providers without switching tabs.

Model	Input (per 1M)	Output (per 1M)	Context
Mistral Ministral 3B	$0.04	$0.04	—
Prompt Guard 86M	$0.05	$0.05	512
DeciCoder 1B	$0.07	$0.07	4k
Dolly 2.0 12B	$0.07	$0.07	—
Phi-1.5	$0.07	$0.07	2k
Phi-2	$0.07	$0.07	2k
Qwen2-1.5B	$0.07	$0.07	—
Cohere Embed English v3.0	$0.1	—	512
Cohere Embed Multilingual v3.0	$0.1	—	512
Mistral Small 2503	$0.1	$0.3	33k
Cohere Embed v4.0	$0.12	—	128k
Phi 4 Reasoning Plus	$0.125	$0.5	128k
Mistral 7B v0.1	$0.14	$0.14	8k
Qwen2-7B	$0.15	$0.15	128k
Llama 4 Scout 17B-16E Instruct	$0.2	$0.78	10m
Grok 3 Mini	$0.25	$1.27	131k
Mixtral 8x7B	$0.27	$0.27	32k
Phi-3 Mini 4k	$0.28	$0.84	4k
Phi-3 Vision	$0.28	$0.84	128k
Codestral 2501	$0.3	$0.9	262k
Llama 3.1 8B Instruct	$0.3	$0.61	128k
Mistral NeMo Instruct (2407)	$0.3	$0.3	128k
Phi-3 Mini 128K	$0.3	$0.9	128k
Phi-3 Small 8K	$0.32	$0.96	8k
Llama 4 Maverick 17B Instruct FP8	$0.35	$1.41	1m
Phi-3 Small 128K	$0.35	$1.05	128k
MAI-Transcribe-1	$0.36	—	—
Dolphin 2.9 Llama 3 8B	$0.37	$1.10	8k
Hermes 2 Pro Llama 3 8B	$0.37	$1.10	8k
Llama 3 8B Gradient 262K	$0.37	$1.10	262k
Llama 3 8B Instruct	$0.37	$1.10	8k
Llama 3.2 11B Vision Instruct	$0.37	$0.37	128k
Llama Guard 3 8B	$0.37	$1.10	8k
Nous Llama 3 8B	$0.37	$1.10	8k
NVIDIA Llama 3 ChatQA 8B	$0.37	$1.10	8k
Mistral Medium 2505	$0.4	$2.00	128k
Phi-3 Medium 4K	$0.45	$1.35	4k
Command R	$0.5	$1.50	128k
Jamba-Instruct	$0.5	$0.7	256k
Phi-3 Medium 128K	$0.5	$1.50	128k
CodeLlama 7B	$0.52	$0.67	100k
CodeLlama 7B Python	$0.52	$0.67	100k
DeciLM 7B	$0.52	$0.67	8k
Falcon 7B	$0.52	$0.67	—
Llama 2 7B Chat	$0.52	$0.67	4k
Orca 2 7B	$0.52	$0.67	4k
SOLAR 10.7B	$0.52	$0.67	4k
Llama 3.3 70B Instruct (free)	$0.71	$0.71	66k
MAI-Code-1-Flash	$0.75	$4.50	256k
CodeLlama 13B	$0.81	$0.94	100k
CodeLlama 13B Python	$0.81	$0.94	100k
Fugaku-LLM 13B	$0.81	$0.94	4k
Llama 2 13B Chat	$0.81	$0.94	4k
Orca 2 13B	$0.81	$0.94	4k
WizardLM 13B V1.1	$0.81	$0.94	2k
Claude Haiku 4.5	$1.00	$5.00	200k
Mistral Small	$1.00	$3.00	32k
Qwen2-72B	$1.00	$2.00	128k
Smaug 72B	$1.00	$2.00	32k
Command R 08-2024	$1.50	$2.00	131k
Qwen1.5-110B	$1.50	$2.50	32k
CodeLlama 34B	$1.54	$1.77	100k
CodeLlama 34B Python	$1.54	$1.77	100k
Falcon 40B	$1.54	$1.77	—
Llama 2 70B Chat	$1.54	$1.77	4k
Arctic	$2.00	$2.00	4k
Mixtral 8x22B v0.1	$2.00	$6.00	64k
Llama 3.2 90B Vision Instruct	$2.04	$2.04	128k
Command A (03-2025)	$2.50	$10.00	256k
Command R+ 08-2024	$2.50	$10.00	131k
Llama 3.1 70B Instruct	$2.68	$3.54	128k
DBRX Instruct	$2.70	$2.70	32k
Claude Sonnet 4.5	$3.00	$15.00	200k
Claude Sonnet 4.6	$3.00	$15.00	1m
Command R+	$3.00	$15.00	128k
Grok-3	$3.00	$15.00	131k
Mistral Large 2 (2407)	$3.00	$9.00	128k
Jais 30B	$3.20	$9.71	2k
CodeLlama 70B	$3.78	$11.34	16k
CodeLlama 70B Python	$3.78	$11.34	16k
Llama 3 70B Instruct	$3.78	$11.34	8k
Llama 3 TenyxChat 70B	$3.78	$11.34	—
NVIDIA Llama 3 ChatQA 70B	$3.78	$11.34	8k
Mistral Large	$4.00	$12.00	32k
Claude Opus 4.5	$5.00	$25.00	200k
Claude Opus 4.6	$5.00	$25.00	1m
Claude Opus 4.7	$5.00	$25.00	1m
MAI-Image-2	$5.00	$33.00	—
Llama 3.1 405B Instruct	$5.33	$16.00	128k
Claude Fable 5	$10.00	$50.00	1m
Claude Opus 4.1	$15.00	$75.00	200k
MAI-Voice-1	$22.00	—	—
Bria 2.3 Fast	—	—	—
Claude 3.5 Sonnet	—	—	200k
Claude Mythos Preview	—	—	1m
Claude Opus 4.8	—	—	1m
Claude Sonnet 5	—	—	1m
Cohere Rerank v3.5	—	—	4k
Cohere Rerank v4.0 Fast	—	—	32k
Cohere Rerank v4.0 Pro	—	—	32k
DeepSeek R1	—	—	128k
DeepSeek R1 0528	—	—	130k
DeepSeek V3	—	—	64k
DeepSeek V3 0324	—	—	160k
DeepSeek V3.1	—	—	64k
DeepSeek V3.2	—	—	160k
DeepSeek V3.2 Speciale	—	—	164k
DeepSeek V4 Flash	—	—	1m
FLUX.1.1 [pro]	—	—	—
Grok 4	—	—	256k
Grok 4 Fast Non-Reasoning	—	—	2m
Grok 4 Fast Reasoning	—	—	2m
Grok 4.3	—	—	1m
Grok Code Fast 1	—	—	262k
Kimi K2.5	—	—	256k
Kimi K2.6	—	—	262k
MAI-Code-1	—	—	—
MAI-Image-2.5	—	—	32k
MAI-Image-2.5-Flash	—	—	32k
MAI-Image-2e	—	—	33k
MAI-Thinking-1	—	—	256k
MAI-Transcribe-1.5	—	—	—
MAI-Voice-2	—	—	—
Mistral Document AI 2505	—	—	—
Mistral Document AI 2512	—	—	—
Mistral Large 3 675B Instruct	—	—	128k
Phi 4 Multimodal Instruct	—	—	128k
Phi 4 Reasoning	—	—	128k
Phi-4 14B	—	—	16k
Rerank English V3	—	—	4k
Rerank Multilingual V3	—	—	4k
Stable Diffusion 3.5 Large	—	—	—
Stable Image Core	—	—	—
Stable Image Ultra	—	—	—
TimeGEN-1	—	—	—

Where else to run this

Mistral Large 3 675B Instruct on Microsoft Foundry

Provider setup and pricing

Llama Guard 3 8B on Microsoft Foundry

Provider setup and pricing

Llama 2 7B Chat on Microsoft Foundry

Provider setup and pricing

Mistral Large 3 675B Instruct on OpenRouter

Alternative host

Llama Guard 3 8B on Cloudflare Workers AI

Alternative host

Llama 2 7B Chat on Alibaba Cloud PAI-EAS

Alternative host

Pricing Overview

Cheapest$0.04/1M

Most expensive$22.00/1M

About Microsoft Foundry

Microsoft Foundry offers a comprehensive platform-as-a-service for enterprise AI operations. It provides multiple deployment options including Serverless APIs (pay-as-you-go), Global Standard (shared managed capacity), Provisioned Throughput Units (reserved capacity), batch processing, and bring-your-own model deployments. The platform features a unified control plane for models, agents, tools, and observability. Its Agent Service enables building and deploying AI agents with built-in tracing, monitoring, and governance. Evaluation and monitoring tools assess model performance, safety, and groundedness. Foundry supports seamless upgrades from Azure OpenAI with non-destructive migration, maintaining existing deployments while unlocking multi-provider model access and advanced platform capabilities.

Full provider profile →

Links

Dashboard Documentation Pricing