Azure OpenAI Models — Pricing & Benchmarks
18 models available · Microsoft
Azure OpenAI hosts 18 AI models in this catalog. The lowest listed input price is babbage at $0.4/1M input tokens. LLM Reference lets you compare these models across all 80 providers without switching tabs.
| Model | Input (per 1M) | Output (per 1M) | Context | |
|---|---|---|---|---|
| babbage | $0.4 | $0.4 | 2k | |
| GPT-3.5 Turbo | $0.5 | $1.50 | 16k | |
| GPT-3.5 Turbo 16k | $0.5 | $2.00 | 16k | |
| GPT-3.5 Turbo (Instruct) | $1.50 | $2.00 | 4k | |
| davinci | $2.00 | $2.00 | 2k | |
| GPT-4o (05-13) | $5.00 | $15.00 | 128k | |
| GPT-4 Turbo | $10.00 | $30.00 | 128k | |
| GPT-4 Turbo Preview | $10.00 | $30.00 | 128k | |
| GPT-4 Vision Preview | $10.00 | $40.00 | 128k | |
| GPT-4 | $30.00 | $60.00 | 8k | |
| GPT-2 | — | — | 1k | |
| GPT-2 Large | — | — | 1k | |
| GPT-2 Medium | — | — | 1k | |
| GPT-4.1 | — | — | 1.05m | |
| GPT-4.5 | — | — | 8k | |
| GPT-4o | — | — | 128k | |
| GPT-4o-mini | — | — | 128k | |
| o3 Mini | — | — | 200k |
Where else to run this
Pricing Overview
About Azure OpenAI
Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.
Full provider profile →