LLM ReferenceLLM Reference

GPT-1

gpt

Researched 137d ago

Last refreshed 2026-04-15. Next refresh: weekly.

GPT-1 has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Decision context: Coding task fit, 0 tracked provider routes, and research from 2026-01-01.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 512 context window

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows

Cheapest output

-

No tracked output price

Provider routes

0

No provider route in seed

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-01-01

Researched 137d ago

stale

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

GPT-1, released by OpenAI in 2018, was a groundbreaking large language model that introduced a 12-layer decoder-only transformer architecture. It featured 12 masked self-attention heads with 64-dimensional states, resulting in a total of 768 dimensions 1310. Despite its relatively modest parameter size of 117 million, GPT-1 effectively demonstrated the potential for generating human-like text, answering questions, and completing sentences 56. However, it faced limitations such as a limited context window and the requirement for significant labeled data for fine-tuning 4. Nevertheless, GPT-1 laid a crucial foundation for future models, paving the way for the development of more advanced iterations like GPT-2 and GPT-3, which expanded and improved upon its capabilities 2.

GPT-1 has a 512-token context window.

Capabilities

No model capability flags are currently sourced.

Rankings

Specifications

FamilyGPT-1
Released2018-06-11
Parameters120M
Context512
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuned

Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website