LLM ReferenceLLM Reference

GPT-OSS Models by OpenAI

4 models2025Up to 131K ctxFrom $0.03/1M input

About

GPT-OSS is a family of 4 AI models by OpenAI, released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

4 in view

Use when the workload needs structured outputs.

2025-09structured outputs

Use when the workload needs 131K context, 120B parameters, and tool use.

2025-08131K context120B parameterstool use

Use when the workload needs 131K context, 20B parameters, and tool use.

2025-08131K context20B parameterstool use

Use when the workload needs safety, 131K context, and 20B parameters.

2025-08safety131K context20B parameters

Release Timeline

2 release groups
2025-09
1 current
Current
2025-08
3 current
GPT OSS Safeguard 20B
safety131K context20B parameters
Current
gpt-oss-120b
131K context120B parameterstool use
Current
gpt-oss-20b
131K context20B parameterstool use
Current

Specifications(4 models)

GPT-OSS model specifications comparison
ModelReleasedContextParametersFn CallingTool UseStructured Outputs
OpenAI GPT OSS Safeguard 120B2025-09NoNoYes
gpt-oss-120b2025-08131K120BYesYesYes
gpt-oss-20b2025-08131K20BYesYesYes
GPT OSS Safeguard 20B2025-08131K20BYesYesYes

Available From(8 providers)

Pricing

GPT-OSS model pricing by provider
ModelProviderInput / 1MOutput / 1MType
gpt-oss-20bOpenRouter$0.03$0.14Serverless
gpt-oss-120bOpenRouter$0.039$0.18Serverless
gpt-oss-20bFireworks AI$0.07$0.3Serverless
gpt-oss-20bGCP Vertex AI$0.07$0.25Serverless
GPT OSS Safeguard 20BAWS Bedrock$0.07$0.2Serverless
gpt-oss-20bGroqCloud$0.075$0.3Serverless
GPT OSS Safeguard 20BGroqCloud$0.075$0.3Serverless
GPT OSS Safeguard 20BOpenRouter$0.075$0.3Serverless
gpt-oss-120bGCP Vertex AI$0.09$0.36Serverless
gpt-oss-20bReplicate API$0.09$0.36Serverless
gpt-oss-120bTogether AI$0.15$0.6Serverless
gpt-oss-120bFireworks AI$0.15$0.6Serverless
gpt-oss-120bGroqCloud$0.15$0.6Serverless
OpenAI GPT OSS Safeguard 120BAWS Bedrock$0.15$0.6Serverless
gpt-oss-120bReplicate API$0.18$0.72Serverless

Frequently Asked Questions

What is GPT-OSS used for?
GPT-OSS is used for safety, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does GPT-OSS compare to GPT Realtime 2?
GPT-OSS by OpenAI is strongest where you need safety, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-OSS has 4 listed variants and reaches up to 131K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which GPT-OSS model should I use?
For the lowest listed input price, start with gpt-oss-20b through OpenRouter at $0.03/1M input tokens. For the most capable/latest local choice, evaluate gpt-oss-120b with 131K context and tool use, function calling, and structured outputs.

Models(4)