LLM ReferenceLLM Reference
Azure OpenAI

GPT-4o (05-13) on Azure OpenAI

GPT-4o · OpenAI

Serverless

Last refreshed 2026-05-10. Next refresh: weekly.

Why use GPT-4o (05-13) on Azure OpenAI?

Azure OpenAI offers GPT-4o (05-13) with pay-as-you-go pricing at $5.00/1M input tokens. Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs.

Compare GPT-4o (05-13) across 4 providers to find the best fit for your use case
Input / 1M
$5.00
Output / 1M
$15.00
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export AZURE_OPENAI_API_KEY=...
Call
import os
from openai import AzureOpenAI
client = AzureOpenAI(
    azure_endpoint=os.environ["AZURE_OPENAI_ENDPOINT"],  # e.g. https://{resource}.openai.azure.com
Model ID
gpt-4o-2024-05-13

Request example

import os
from openai import AzureOpenAI

client = AzureOpenAI(
    azure_endpoint=os.environ["AZURE_OPENAI_ENDPOINT"],  # e.g. https://{resource}.openai.azure.com
    api_key=os.environ["AZURE_OPENAI_API_KEY"],
    api_version="2024-02-01"
)
response = client.chat.completions.create(
    model="gpt-4o-2024-05-13",  # your deployment name
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • gpt-4o-2024-05-13 is your Azure deployment name, not the underlying model name. Deployment names are set when you deploy a model in Azure AI Foundry / Azure OpenAI Studio.
  • The examples expect AZURE_OPENAI_API_KEY; rename it only if your application config maps the new variable.

Compare GPT-4o (05-13) Across Providers

ProviderInput (per 1M)Output (per 1M)
Azure OpenAI$5.00$15.00
OpenAI API$2.50$10.00
OpenRouter$5.00$15.00
Replicate API$2.50$10.00

Pricing

TypePrice (per 1M)
Input tokens$5.00
Output tokens$15.00

Capabilities

VisionMultimodalStructured OutputsCode Execution

About GPT-4o (05-13)

GPT-4o, or GPT-4 Omni, is OpenAI's premier multimodal AI model, introduced on May 13, 2024. It is designed to handle and generate content from various input types, including text, audio, and visual data, using a unified model, unlike its predecessors which required separate models for different modalities. This model is notable for its rapid response times averaging 320 milliseconds, which align with human conversational speed, along with being 50% more cost-effective than GPT-4 Turbo. Although it mirrors GPT-4 Turbo in standard benchmark performance, GPT-4o excels in multilingual, audio, and vision tasks. Despite its advancements, it shares the limitation of a knowledge cutoff at October 2023 and is prone to generating incorrect information, like other language models. A more compact and economical version, GPT-4o mini, is also available.

FAQ

What does GPT-4o (05-13) cost on Azure OpenAI?

On Azure OpenAI, GPT-4o (05-13) costs $5 per 1M input tokens and $15 per 1M output tokens.

What is the context window for GPT-4o (05-13) on Azure OpenAI?

GPT-4o (05-13) supports a 128,000 token context window on Azure OpenAI.

How does Azure OpenAI compare to other GPT-4o (05-13) providers?

GPT-4o (05-13) is available from 4 providers. The cheapest input pricing is $2.50/1M tokens from OpenAI API.

Who created GPT-4o (05-13)?

GPT-4o (05-13) was created by OpenAI as part of the GPT-4o model family.

Is GPT-4o (05-13) open source?

GPT-4o (05-13) is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2024-05-13
Parameters1.76T (8x222B MoE)*
Context128K
ArchitectureMixture of Experts
Knowledge cutoff2023-10

Related Models on Azure OpenAI