LLM Reference

GPT-2 Models by OpenAI

This model family is considered obsolete. Consider newer alternatives in Related Model Families below.
4 models2019Up to 1k ctx

About

GPT-2 is a family of 4 AI models by OpenAI, released in 2019.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

4 in view
GPT-2 XLCurrent

Use when the workload needs 1k context and 1.5B parameters.

2019-111k context1.5B parameters

Use when the workload needs 1k context and 774M parameters.

2019-081k context774M parameters

Use when the workload needs 1k context and 355M parameters.

2019-021k context355M parameters
GPT-2Current

Use when the workload needs 1k context and 124M parameters.

2019-021k context124M parameters

Release Timeline

3 release groups
2019-11
1 current
GPT-2 XL
1k context1.5B parameters
Current
2019-08
1 current
GPT-2 Large
1k context774M parameters
Current
2019-02
2 current
GPT-2
1k context124M parameters
Current
GPT-2 Medium
1k context355M parameters
Current

Specifications(4 models)

GPT-2 model specifications comparison
ModelReleasedContextParameters
GPT-2 XL2019-111k1.5B
GPT-2 Large2019-081k774M
GPT-2 Medium2019-021k355M
GPT-22019-021k124M

Available From(1 provider)

Frequently Asked Questions

What is GPT-2 used for?
GPT-2 is used for evaluating 4 related AI models by specs, pricing, provider access, and release data.
How does GPT-2 compare to GPT Realtime 2?
GPT-2 by OpenAI is strongest where you need its listed use cases, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-2 has 4 listed variants and reaches up to 1k context, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which GPT-2 model should I use?
If price is the main constraint, use the pricing table first because GPT-2 does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate GPT-2 XL with 1k context.

Models(4)