Cerebras

9 models across 2 families · Latest: Cerebras LLaVA 13B (2024-08)

Researched 33d ago

World's largest AI chip innovation

CodingAgents

Cerebras's portfolio covers 9 active models across 2 current families, spanning coding and agents. Open a model detail page to compare provider routes and sourced benchmarks.
Covers 2 workload areas across 9 active tracked models; last verified 2026-06-15.

Use it for

Teams evaluating coding and agents across this lab's releases
Comparing model families before committing to a flagship
Migration and pricing follow-ups across 9 tracked models

Do not use it for

Choosing a hosting provider without opening a model page for price ladders

Active models

Current models from this lab, excluding deprecated ones

Active families

Current model families from this lab

Open catalog

9 open

9 open source / 0 open weights

Lowest output price

Not tracked

No provider output pricing linked yet

Latest dated release

2024-08-01

Cerebras LLaVA 13B

Freshness

2026-06-15

Researched 33d ago

aging

Information

Founded2016

Sunnyvale, California, United States

Links

Website GitHub X / Twitter LinkedIn HuggingFace Crunchbase

Release cadence

Showing 5 recent dated releases (full timeline below). Latest: Cerebras LLaVA 13B (2024-08-01).

Where this lab wins

Coding: 1 tracked model with SWE-bench / HumanEval-style scores.
Agentic: 1 tracked model with BFCL, tau-bench, and SWE-bench tool-use coverage.

Flagship quality / price signal

Flagship: Cerebras GPT 13B (best sourced coding quality-per-dollar in this portfolio).

Quality-per-dollar unavailable for this flagship — benchmark coverage or output token pricing is still missing.

Cerebras is an American AI research organization founded in 2016. World's largest AI chip innovation. Cerebras ships 2 model families totaling 9 models, with the most recent release Cerebras LLaVA 13B in 2024-08. Notable families include Cerebras LLaVA and Cerebras GPT. Use it as a stable reference for lab background, release coverage, and follow-up model pages as they are added. View official API endpoints, benchmark performance, and coding/agent fit for every Cerebras model.

About

Cerebras Systems is a forward-thinking leader in the AI domain, best known for its innovation in generative AI and the development of large language models (LLMs). Established in 2016 in Sunnyvale, California, the company was founded by a group of illustrious computer scientists and deep learning experts, who aimed to tackle one of the most challenging obstacles in the AI industry: the acceleration and efficiency of deep learning processes. Leveraging their past experiences, particularly from SeaMicro, the founding team brought groundbreaking ideas to the forefront of AI technology. At the heart of Cerebras' groundbreaking work is its pioneering wafer-scale engines (WSEs). These colossal chips transcend the capacity of traditional processors by incorporating compute, memory, and interconnect fabric on a single, expansive wafer. This distinct architecture is designed to significantly reduce data transfer and latency, offering unprecedented performance improvements in AI training and inference. The evolution of this technology from the WSE-2 to the WSE-3 signifies the company's relentless pursuit of excellence, culminating in systems such as the Cerebras CS-2 and CS-3, renowned for their remarkable speeds and capabilities. Cerebras' contributions to generative AI are substantial, highlighted by its collaboration with the Mayo Clinic and others in utilizing their powerful hardware to push the boundaries of AI applications in medical fields and other domains. The company's strategic release of seven open-source GPT-based large language models, trained on the highly advanced Andromeda AI supercluster, underscores its commitment and capacity to drive large-scale AI model training efficiently. This initiative not only illustrates Cerebras' proficiency in developing sophisticated AI models but also reflects its dedication to nurturing the AI community through open-source advancements. Navigating a competitive landscape dominated by tech giants like Nvidia, Cerebras has positioned itself as a formidable challenger. By capitalizing on the superior performance and cost-effectiveness of its wafer-scale technologies for large-scale AI applications, the company has secured a robust foothold within the AI chip market. Moreover, its notable revenue growth and strategic partnerships, especially with organizations such as G42, further extend its influence and credibility in the industry. Yet, like any ambitious venture, Cerebras is mindful of potential challenges, particularly its current financial reliance on key partners such as G42, as it looks towards its future growth and potential IPO efforts.

Featured models

Model	Released	Context	Input price ($/1M)	Output price ($/1M)	License	Openness
Cerebras LLaVA 13B	2024-08-01	4k	-	-	Apache 2.0	Open source
Cerebras LLaVA 7B	2024-08-01	4k	-	-	Apache 2.0	Open source
Cerebras GPT 13B	2023-03-13	2k	-	-	Apache 2.0	Open source

Model families

Cerebras LLaVA

Cerebras GPT

Recent releases

Cerebras LLaVA 13B- 2024-08-01
Cerebras LLaVA 7B- 2024-08-01
Cerebras GPT 13B- 2023-03-13
Cerebras GPT 7B- 2023-03-13
Cerebras GPT 2.7B- 2023-03-13

FAQ

Who founded Cerebras and when?

Cerebras was founded in 2016 and is associated with Sunnyvale, California, United States.

What models has Cerebras released?

Cerebras ships 9 models across 2 families: Cerebras LLaVA and Cerebras GPT.

Is Cerebras's technology open source?

All tracked models are released under Open Source.

Where is Cerebras headquartered?

Cerebras is headquartered in Sunnyvale, California, United States.

What is Cerebras known for?

World's largest AI chip innovation. Its most prominent tracked family is Cerebras LLaVA.

How can I access Cerebras's models?

Cerebras's provider availability is tracked on model pages as API and hosting data is verified.

Explore related pages

Cerebras LLaVA model family Cerebras GPT model family Cerebras LLaVA 13B model spec Cerebras LLaVA 7B model spec Cerebras GPT 13B model spec Microsoft AI Snowflake Reka Adept AI Midjourney

Last reviewed: 2026-06-15. Data sourced from public lab announcements and provider documentation.