LLM ReferenceLLM Reference
Azure OpenAI

Azure OpenAI Models — Pricing & Benchmarks

18 models available · Microsoft

Azure OpenAI hosts 18 AI models in this catalog. The lowest listed input price is babbage at $0.4/1M input tokens. LLM Reference lets you compare these models across all 63 providers without switching tabs.

ModelInput (per 1M)Output (per 1M)Context
babbage$0.40$0.402K
GPT-3.5 Turbo$0.5$1.516K
GPT-3.5 Turbo 16k$0.5$2.0016K
GPT-3.5 Turbo (Instruct)$1.5$24K
davinci$2.00$2.002K
GPT-4o (05-13)$5$15128K
GPT-4 Turbo$10$30128K
GPT-4 Turbo Preview$10$30128K
GPT-4 Vision Preview$10$40.00128K
GPT-4$30$608K
GPT-21K
GPT-2 Large1K
GPT-2 Medium1K
GPT-4.11M
GPT-4.58K
GPT-4o128K
GPT-4o-mini128K
o3 Mini200K

Pricing Overview

Cheapest$0.40/1M
Most expensive$30.00/1M

About Azure OpenAI

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.

Full provider profile →