LLM Reference

GPT-3.5 Turbo

Released
2023-03-01
Last refreshed
2026-05-22
Status
Researched 25d ago
DeprecatedProprietaryCodingClassificationJSON / Tool use

GPT-3.5 Turbo is a legacy integration reference; evaluate GPT-4.1 Mini before starting new work.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 16k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
Specifications
Family
GPT-3.5
Released
2023-03-01
Context
16k
Parameters
20B
Architecture
Decoder Only
Knowledge cutoff
2021-09
Specialization
general
License
Proprietary
Training
finetuned
Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website
Pricing
Output / 1M
$1.50
Input / 1M
$0.500

Cheapest of 6 routes · Azure OpenAI

This model is deprecated. OpenAI recommends switching to GPT-4.1 Mini.

About

GPT-3.5 Turbo is an advanced language model developed by OpenAI, showcasing significant advancements over GPT-3 and GPT-3.5. As the engine behind the popular ChatGPT application, it excels in tasks like text generation, translation, question answering, summarization, and code generation. This model employs Reinforcement Learning from Human Feedback (RLHF) to enhance accuracy and produce policy-optimized responses. Despite its prowess, it has a knowledge cutoff of September 2021 and can demonstrate biases from its training data. Occasionally, it may generate incorrect or nonsensical content, known as "hallucination," and is sensitive to input phrasing variations. Additionally, the free version may experience slowdowns due to high demand. Nevertheless, GPT-3.5 Turbo remains a powerful tool with versatile applications across numerous fields.

GPT-3.5 Turbo is a proprietary model in the GPT-3.5 family. The structured metadata tracks a 16k-token context window and structured outputs. This page tracks provider routes through Azure OpenAI, OpenAI API, Salesforce Einstein Generative AI, and 3 more, with the cheapest tracked route listed at $0.5 input and $1.5 output per 1M tokens. Headline tracked benchmarks include HumanEval 67.0, Massive Multitask Language Understanding 70.0, and GAOKAO 53.2.

Top use-case fit: coding, agents, and build tasks

Coding

1 relevant benchmark in the decision map.

Classification

1 relevant benchmark in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MBatch in / outRoute
Azure OpenAI$0.500$1.50-
Serverless
OpenAI API$0.500$1.50$0.250 / $0.750
Serverless
OpenRouter$0.500$1.50-
Serverless
Replicate API$0.500$1.50-
Serverless

Capabilities

Structured Outputs

Benchmark peer barsfor Coding

Benchmark scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
HumanEval67.0pass@1https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding70.05-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
GAOKAO53.2zero-shot, objective-accuracyhttps://github.com/OpenLMLab/GAOKAO-Bench

Migration checks

No linked migration route is available for this model yet.

API versions

gpt-3.5-turbo-0301gpt-3.5-turbo-0613gpt-3.5-turbo-1106gpt-3.5-turbo-0125gpt-3.5-turbo

Rankings & picks(1)