LLM Reference

GPT-1

Released
2018-06-11
Last refreshed
2026-04-15
Status
Researched 182d ago
ProprietaryCommercial use: conditional

GPT-1 is an early limited-data entry; LLMReference does not yet track enough provider, pricing, benchmark, or task-fit evidence to recommend it.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 512 context window

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
GPT-1
Released
2018-06-11
Context
512
Parameters
120M
Architecture
Decoder Only
Specialization
general
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Training
Fine-tuned
Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website
Pricing

No tracked provider token pricing is available yet.

About

GPT-1, released by OpenAI in 2018, was a groundbreaking large language model that introduced a 12-layer decoder-only transformer architecture. It featured 12 masked self-attention heads with 64-dimensional states, resulting in a total of 768 dimensions 1310. Despite its relatively modest parameter size of 117 million, GPT-1 effectively demonstrated the potential for generating human-like text, answering questions, and completing sentences 56. However, it faced limitations such as a limited context window and the requirement for significant labeled data for fine-tuning 4. Nevertheless, GPT-1 laid a crucial foundation for future models, paving the way for the development of more advanced iterations like GPT-2 and GPT-3, which expanded and improved upon its capabilities 2.

GPT-1 is a proprietary model. The structured metadata tracks a 512-token context window. No headline benchmark score is tracked for GPT-1 yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Compare GPT-1 with other models

Show all 43 popular comparisonssorted by 7-day search impressions

Frequently asked questions

What is the context window of GPT-1?

GPT-1 has a context window of 512 tokens.

When was GPT-1 released?

GPT-1 was released on 2018-06-11.