LLM ReferenceLLM Reference
Azure OpenAI

Azure OpenAI

HyperscalerTier 1

Microsoft

HighlightHyperscaler

Platform Overview

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.

SDKs & Libraries

Available Models(18)

View all →
ModelInput (per 1M)Output (per 1M)Batch input (per 1M)Batch output (per 1M)Type
GPT-4.5
Serverless
GPT-4.1$1.00$4.00
Serverless
o3 Mini$0.15$0.60
Serverless
GPT-4o-mini
Serverless
GPT-4o$1.25$5.00
Serverless
GPT-4o (05-13)$5$15
Serverless
GPT-4 Turbo$10$30
Serverless
GPT-4 Turbo Preview$10$30
Serverless
GPT-4 Vision Preview$10$40.00
Serverless
GPT-3.5 Turbo (Instruct)$1.5$2
Serverless
View full catalog →

Platform Details

TypeHyperscaler
TierTier 1
Models18

Organization

Microsoft
Founded1975
Redmond, Washington, United States

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Microsoft's broader AI platform spans Azure Machine Learning, Azure Cognitive Services, and Azure AI Search, plus productivity tools like Microsoft 365 Copilot and developer tooling in Visual Studio and the .NET framework. The platform emphasizes responsible AI, security, and regional compliance.