LLM ReferenceLLM Reference
Azure OpenAI

GPT-3.5 Turbo 16k on Azure OpenAI

GPT-3.5 · OpenAI

Serverless

Last refreshed 2026-04-27. Next refresh: weekly.

Why use GPT-3.5 Turbo 16k on Azure OpenAI?

Azure OpenAI offers GPT-3.5 Turbo 16k with pay-as-you-go pricing at $0.50/1M input tokens. Azure OpenAI Service hosts OpenAI's GPT-4o, GPT-4, GPT-3.5, and embedding models on Microsoft Azure with enterprise SLAs.

Compare GPT-3.5 Turbo 16k across 3 providers to find the best fit for your use case
Input / 1M
$0.50
Output / 1M
$2.00
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export AZURE_OPENAI_API_KEY=...
Call
import os
from openai import AzureOpenAI
client = AzureOpenAI(
    azure_endpoint=os.environ["AZURE_OPENAI_ENDPOINT"],  # e.g. https://{resource}.openai.azure.com
Model ID
gpt-3.5-turbo-16k

Request example

import os
from openai import AzureOpenAI

client = AzureOpenAI(
    azure_endpoint=os.environ["AZURE_OPENAI_ENDPOINT"],  # e.g. https://{resource}.openai.azure.com
    api_key=os.environ["AZURE_OPENAI_API_KEY"],
    api_version="2024-02-01"
)
response = client.chat.completions.create(
    model="gpt-3.5-turbo-16k",  # your deployment name
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • gpt-3.5-turbo-16k is your Azure deployment name, not the underlying model name. Deployment names are set when you deploy a model in Azure AI Foundry / Azure OpenAI Studio.
  • The examples expect AZURE_OPENAI_API_KEY; rename it only if your application config maps the new variable.

Compare GPT-3.5 Turbo 16k Across Providers

ProviderInput (per 1M)Output (per 1M)
Azure OpenAI$0.50$2.00
Salesforce Einstein Generative AI
OpenRouter$3.00$4.00

Pricing

TypePrice (per 1M)
Input tokens$0.50
Output tokens$2.00

Capabilities

Structured Outputs

About GPT-3.5 Turbo 16k

GPT-3.5 Turbo 16k, developed by OpenAI, is an advanced language model featuring a significantly enhanced context window of 16,384 tokens—four times larger than its predecessor's 4,096 tokens 245. This extension allows it to process and comprehend extended texts, up to approximately 20 pages, in a single interaction, while maintaining the speed and efficiency of earlier versions 10. Although it is a chat-centric model not compatible with the completions endpoint 8, it remains highly effective for tasks requiring prolonged relevance and coherence through OpenAI's API 46. With a knowledge cutoff in September 2021, it surpasses its predecessors in capability, yet it is less advanced than GPT-4 in visual and multilingual contexts 35.

FAQ

What does GPT-3.5 Turbo 16k cost on Azure OpenAI?

On Azure OpenAI, GPT-3.5 Turbo 16k costs $0.5 per 1M input tokens and $2.00 per 1M output tokens.

What is the context window for GPT-3.5 Turbo 16k on Azure OpenAI?

GPT-3.5 Turbo 16k supports a 16,000 token context window on Azure OpenAI.

How does Azure OpenAI compare to other GPT-3.5 Turbo 16k providers?

GPT-3.5 Turbo 16k is available from 3 providers. The cheapest input pricing is $0.5/1M tokens from Azure OpenAI.

Who created GPT-3.5 Turbo 16k?

GPT-3.5 Turbo 16k was created by OpenAI as part of the GPT-3.5 model family.

Is GPT-3.5 Turbo 16k open source?

GPT-3.5 Turbo 16k's open source status is unknown in the seed data.

Get Started

Model Specs

Released2023-06-13
Parameters20B
Context16K
ArchitectureDecoder Only
Knowledge cutoff2021-09

Related Models on Azure OpenAI