LLM Reference

Chinchilla Models by Google DeepMind

This model family is considered obsolete. Consider newer alternatives in Related Model Families below.
2 models2022

About

The Chinchilla family of large language models, developed by Google DeepMind, was introduced in March 2022. These models are notable for their exploration of the scaling laws in LLM training. Uniquely, they highlighted that for optimal model performance, the size of the model and the number of training tokens should be proportionately scaled. For instance, the Chinchilla model with 70 billion parameters used the same computational resources as a 280 billion parameter Gopher model but was trained on quadruple the data, leading to enhanced performance across numerous benchmarks. This approach challenged the previous assumption that increasing model size inherently improves performance, emphasizing the critical role of ample data in achieving state-of-the-art results 1)23.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Use when the workload needs 280B parameters.

2022-03280B parameters

Use when the workload needs 70B parameters.

2022-0370B parameters

Release Timeline

1 release group
2022-03
2 current
Chinchilla 70B
70B parameters
Current
Gopher 280B
280B parameters
Current

Specifications(2 models)

Chinchilla model specifications comparison
ModelReleasedParameters
Gopher 280B2022-03280B
Chinchilla 70B2022-0370B

Frequently Asked Questions

What is Chinchilla used for?
Chinchilla is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Chinchilla compare to Gemma 4?
Chinchilla by Google DeepMind is strongest where you need coding, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Chinchilla has 2 listed variants, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which Chinchilla model should I use?
If price is the main constraint, use the pricing table first because Chinchilla does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Gopher 280B.

Models(2)