Azure OpenAI Models — Pricing & Benchmarks
18 models available · Microsoft
Azure OpenAI hosts 18 AI models in this catalog. The lowest listed input price is babbage at $0.4/1M input tokens. LLM Reference lets you compare these models across all 63 providers without switching tabs.
| Model | Input (per 1M) | Output (per 1M) | Context | |
|---|---|---|---|---|
| babbage | $0.40 | $0.40 | 2K | |
| GPT-3.5 Turbo | $0.5 | $1.5 | 16K | |
| GPT-3.5 Turbo 16k | $0.5 | $2.00 | 16K | |
| GPT-3.5 Turbo (Instruct) | $1.5 | $2 | 4K | |
| davinci | $2.00 | $2.00 | 2K | |
| GPT-4o (05-13) | $5 | $15 | 128K | |
| GPT-4 Turbo | $10 | $30 | 128K | |
| GPT-4 Turbo Preview | $10 | $30 | 128K | |
| GPT-4 Vision Preview | $10 | $40.00 | 128K | |
| GPT-4 | $30 | $60 | 8K | |
| GPT-2 | — | — | 1K | |
| GPT-2 Large | — | — | 1K | |
| GPT-2 Medium | — | — | 1K | |
| GPT-4.1 | — | — | 1M | |
| GPT-4.5 | — | — | 8K | |
| GPT-4o | — | — | 128K | |
| GPT-4o-mini | — | — | 128K | |
| o3 Mini | — | — | 200K |
Pricing Overview
About Azure OpenAI
Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.
Full provider profile →