GPT-1

Name: GPT-1
Author: OpenAI

Released

2018-06-11

Last refreshed

2026-04-15

Status

Researched 182d ago

ProprietaryCommercial use: conditional

GPT-1 is an early limited-data entry; LLMReference does not yet track enough provider, pricing, benchmark, or task-fit evidence to recommend it.

Use it for

Teams evaluating general LLM work
Workloads that can use a 512 context window

Do not use it for

Cost-sensitive launches that need sourced token pricing
Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: GPT-1
Released: 2018-06-11
Context: 512
Parameters: 120M
Architecture: Decoder Only
Specialization: general
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Training: Fine-tuned

Created by

OpenAI

Cutting-edge research and development.

San Francisco, California, United States

Founded 2015

Website

Pricing

No tracked provider token pricing is available yet.

About

GPT-1, released by OpenAI in 2018, was a groundbreaking large language model that introduced a 12-layer decoder-only transformer architecture. It featured 12 masked self-attention heads with 64-dimensional states, resulting in a total of 768 dimensions 1310. Despite its relatively modest parameter size of 117 million, GPT-1 effectively demonstrated the potential for generating human-like text, answering questions, and completing sentences 56. However, it faced limitations such as a limited context window and the requirement for significant labeled data for fine-tuning 4. Nevertheless, GPT-1 laid a crucial foundation for future models, paving the way for the development of more advanced iterations like GPT-2 and GPT-3, which expanded and improved upon its capabilities 2.

GPT-1 is a proprietary model. The structured metadata tracks a 512-token context window. No headline benchmark score is tracked for GPT-1 yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Show all 43 popular comparisonssorted by 7-day search impressions

GPT-1 vs Seed-OSS 36B Instruct8 GPT-1 vs GPT-4 Turbo Preview7 GPT-1 vs Granite Guardian 3.0 8B7 GPT-1 vs Magistral Small 25066 GPT-1 vs Mistral 7B v0.15 GPT-1 vs Nemotron Mini Hindi 4B Instruct5 GPT-1 vs Nemotron 4 340B5 GPT-1 vs Claude Instant 1.15 GPT-1 vs Llama 3.1 NemoGuard 8B Topic Control5 GPT-1 vs Doubao Pro 256K5 GPT-1 vs Llama 2 7B Chat4 GPT-1 vs Xiaomi MiMo-V2.54 GPT-1 vs Together AI Qwen2-72B-Instruct4 GPT-1 vs Claude Instant 1.24 GPT-1 vs Sarvam-M Multilingual Hybrid4 GPT-1 vs Mistral Large 24 GPT-1 vs Swallow 30B4 GPT-1 vs Llama 3.2 NV EmbedQA 1B v23 GPT-1 vs Italia 10B Instruct3 GPT-1 vs Qwen3.5-9B3 GPT-1 vs Llama 3.1 Swallow 8B Instruct3 GPT-1 vs Dracarys Llama 3.1 70B Instruct2 GPT-1 vs MiniCPM 2B2 GPT-1 vs Mistral Medium 3 Instruct2 GPT-1 vs Mistral Small 32 GPT-1 vs Llama 3 Swallow 70B Instruct2 GPT-1 vs Mistral Medium 3.52 GPT-1 vs Qwen2-7B-Instruct2 GPT-1 vs Llama 2 7B1 GPT-1 vs Codex Mini Latest1 GPT-1 vs Claude Instant1 GPT-1 vs Together AI - Gemma 3n-e4B1 GPT-1 vs Bielik 11B v2.6 Instruct1 GPT-1 vs Llama 3.1 Nemotron Nano 4B v1.11 GPT-1 vs Phi-4 Mini Flash Reasoning1 GPT-1 vs Falcon 3 7B Instruct1 GPT-1 vs ShieldGemma 9B1 GPT-1 vs Teuken 7B Instruct1 GPT-1 vs Aquila 2 34B1 GPT-1 vs Llama 3.1 Nemotron 70B Reward1 GPT-1 vs Llama Guard 2 8B1 GPT-1 vs Mistral Nemotron1 GPT-1 vs Italia 70B Instruct1

Frequently asked questions

What is the context window of GPT-1?

GPT-1 has a context window of 512 tokens.

When was GPT-1 released?

GPT-1 was released on 2018-06-11.