How many Azure OpenAI models does LLMReference track?

LLMReference currently tracks 18 models available through Azure OpenAI's API. Azure OpenAI's full catalog may be larger.

What are Azure OpenAI's most popular models?

Azure OpenAI's top models include GPT-4o (05-13), GPT-3.5 Turbo (Instruct), GPT-3.5 Turbo, GPT-4 Turbo, GPT-4.

What is Azure OpenAI's pricing?

Azure OpenAI pricing ranges from $0.40/1M to $30/1M input tokens depending on the model.

Azure OpenAI

Researched 19d agoHyperscalerTier 1

Microsoft

CodingRAGAgentsLong contextVisionClassificationJSON / Tool useHighlightHyperscaler

Azure OpenAI offers 18 tracked models (10 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 7 workload areas across 18 tracked models; last verified 2026-06-01.

Use it for

Teams comparing token and batch pricing across this provider's models
Operators routing coding, rag, and agents workloads through this API
Batch buyers auditing discount coverage model-by-model

Do not use it for

Final benchmark picks without opening the relevant model detail page

Tracked models

Models available through this provider

Priced output routes

Models with output token pricing tracked

Cheapest output

$0.400

babbage on this route

Batch-ready models

Models with discounted batch pricing

Latest model release

2025-04-01

445d since newest release

Freshness

2026-06-01

Researched 19d ago

fresh

Routes available via routers & gateways

These routers list Azure OpenAI as a target provider, so they can sit in front of this catalog for fallback, routing, or unified API access.

Browse routers ->

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

Passthrough

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionSelf-host

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionSelf-host

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSSelf-host

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionSelf-host

Information

TypeHyperscaler

TierTier 1

Models18

CompanyMicrosoft

Founded1975

Redmond, Washington, United States

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Microsoft's broader AI platform spans Azure Machine Learning, Azure Cognitive Services, and Azure AI Search, plus productivity tools like Microsoft 365 Copilot and developer tooling in Visual Studio and the .NET framework. The platform emphasizes responsible AI, security, and regional compliance.

Links

Website X / Twitter LinkedIn Crunchbase

Catalog freshness

The newest model tracked on this provider was released 2025-04-01 (445d ago).

Where this host wins

Coding: 2 tracked models with SWE-bench / HumanEval-style scores.
RAG: 3 tracked models with ruler / needle retrieval benchmarks.
Agentic: 2 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
Long-context: 3 tracked models with context-token or InfiniteBench-class signal.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Product Docs Portal Pricing

SDKs & libraries

Python JavaScript Java Go .NET

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs. Deployments run in customer-selected regions with private networking, role-based access control, and capacity options spanning Standard pay-per-token, Provisioned Throughput Units (PTUs) for reserved capacity, Global Standard shared capacity, and Batch processing. Azure OpenAI sits inside the wider Microsoft Foundry / Azure AI Studio control plane, which adds an evaluation, monitoring, and Agent Service layer on top of the base model APIs. For workloads that need non-OpenAI models (Claude, DeepSeek, Grok, Llama, Mistral, NVIDIA Nemotron), Microsoft Foundry is the broader catalog; Azure OpenAI is the OpenAI-specific entry point. The service is API-compatible with the OpenAI SDK in most flows, so teams typically swap base URLs and authentication rather than rewriting calls.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(18)

View all →

Model	Input (per 1M)	Output (per 1M)	Batch input (per 1M)	Batch output (per 1M)	Type
GPT-4.1			$1.00	$4.00	Serverless
GPT-4.5			—	—	Serverless
o3 Mini			$0.15	$0.60	Serverless
GPT-4o-mini			—	—	Serverless
GPT-4o			$1.25	$5.00	Serverless
GPT-4o (05-13)	$5	$15	—	—	Serverless
GPT-4 Turbo	$10	$30	—	—	Serverless
GPT-4 Turbo Preview	$10	$30	—	—	Serverless
GPT-4 Vision Preview	$10	$40.00	—	—	Serverless
GPT-3.5 Turbo (Instruct)	$1.5	$2	—	—	Serverless

View full catalog →