LLM ReferenceLLM Reference

Cerebras GPT

7 models2023

About

The Cerebras GPT family includes seven open-source large language models, ranging from 111 million to 13 billion parameters. These models were developed by Cerebras Systems using the Chinchilla formula, optimizing 20 tokens per parameter to achieve high accuracy within a defined compute budget. Available on Hugging Face under the Apache 2.0 license, these models are accessible for both research and commercial use. Training took place on the Andromeda AI supercomputer, leveraging Cerebras' weight streaming technology for efficient computation across multiple nodes. This setup enhances training speed, reduces costs, and minimizes energy consumption, making them notably efficient compared to other models available 12.

Specifications(7 models)

Cerebras GPT model specifications comparison
ModelReleasedParametersReasoningCode Exec
Cerebras GPT 13B2023-0313BNoNo
Cerebras GPT 7B2023-037BNoNo
Cerebras GPT 2.7B2023-032.7BNoNo
Cerebras GPT 1.3B2023-031.3BNoNo
Cerebras GPT 590M2023-03YesYes
Cerebras GPT 256M2023-03NoNo
Cerebras GPT 111M2023-03NoNo

Frequently Asked Questions

What is Cerebras GPT?
The Cerebras GPT family includes seven open-source large language models, ranging from 111 million to 13 billion parameters. These models were developed by Cerebras Systems using the Chinchilla formula, optimizing 20 tokens per parameter to achieve high accuracy within a defined compute budget. Available on Hugging Face under the Apache 2.0 license, these models are accessible for both research and commercial use. Training took place on the Andromeda AI supercomputer, leveraging Cerebras' weight streaming technology for efficient computation across multiple nodes. This setup enhances training speed, reduces costs, and minimizes energy consumption, making them notably efficient compared to other models available 12.
How many models are in the Cerebras GPT family?
The Cerebras GPT family contains 7 models.
What is the latest Cerebras GPT model?
The latest model is Cerebras GPT 13B, released in 2023-03.

Models(7)