LLM ReferenceLLM Reference
Microsoft Foundry

Microsoft Foundry

HyperscalerTier 1

Microsoft

Hyperscaler

Platform Overview

Microsoft Foundry offers a comprehensive platform-as-a-service for enterprise AI operations. It provides multiple deployment options including Serverless APIs (pay-as-you-go), Global Standard (shared managed capacity), Provisioned Throughput Units (reserved capacity), batch processing, and bring-your-own model deployments. The platform features a unified control plane for models, agents, tools, and observability. Its Agent Service enables building and deploying AI agents with built-in tracing, monitoring, and governance. Evaluation and monitoring tools assess model performance, safety, and groundedness. Foundry supports seamless upgrades from Azure OpenAI with non-destructive migration, maintaining existing deployments while unlocking multi-provider model access and advanced platform capabilities.

Available Models(80)

View all →
ModelInput (per 1M)Output (per 1M)Type
Claude Opus 4.7$5$25
Serverless
MAI-Transcribe-1$0.36
Serverless
MAI-Voice-1$22.00
Serverless
MAI-Image-2$5.00$33.00
Serverless
DeepSeek V3.1
ServerlessProvisioned
Grok 4 Fast Non-Reasoning
ServerlessProvisioned
Grok 4 Fast Reasoning
ServerlessProvisioned
Nemotron 3 8B$0.37$1.10
Provisioned
Claude Opus 4.5
ServerlessProvisioned
Claude Haiku 4.5
ServerlessProvisioned
View full catalog →

Platform Details

TypeHyperscaler
TierTier 1
Models80

Organization

Microsoft
Founded2025
Redmond, Washington, United States

Microsoft Foundry is a unified enterprise AI platform that significantly expands beyond Azure OpenAI. It functions as a multi-provider hosting and deployment platform for LLMs, supporting models from OpenAI, Anthropic, DeepSeek, xAI, Meta, Mistral, NVIDIA, and others. Foundry integrates agent services, evaluation, observability, and governance into a single Azure control plane. Key capabilities include a multi-provider model catalog, Model Router for intelligent prompt routing, Foundry Agent Service for building and deploying AI agents with built-in tracing and monitoring, and enterprise-grade governance with RBAC, compliance, and regional deployments. For broader model catalog including Claude, DeepSeek, Grok, Llama, Mistral, and NVIDIA Nemotron, Foundry is the recommended platform over Azure OpenAI.