LLM Reference

Gopher 280B

Released
2022-03-29
Last refreshed
2026-04-15
Status
Researched 154d ago

Gopher 280B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Use it for

  • Teams evaluating general LLM work

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Released
2022-03-29
Parameters
280B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website
Pricing

No tracked provider token pricing is available yet.

About

Gopher 280B, developed by DeepMind, is a significant advancement in the field of natural language processing, featuring an impressive 280 billion parameters. This model surpasses OpenAI's GPT-3 in size while employing a Transformer-based architecture with modifications like RMSNorm and relative positional encoding to enhance performance on longer text sequences. Trained on a massive 10.5 TB dataset, Gopher excels in various NLP tasks such as reading comprehension and toxic language detection but faces challenges in logical reasoning tasks. Despite its capabilities, the model still exhibits limitations like repetition and bias reflection, prompting the need for improved training techniques to enhance accuracy and mitigate biases 124.

Gopher 280B is a model in the Chinchilla family. No headline benchmark score is tracked for Gopher 280B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(4)