LLM Reference
Cerebras

Cerebras

Researched 13d ago
Flagship Q/$
Quality
$/M out

9 models across 2 families · Latest: Cerebras LLaVA 13B (2024-08)

World's largest AI chip innovation

CodingAgents

Cerebras's portfolio covers 9 active models across 2 non-obsolete families, with task labels spanning coding and agents. Open a model detail page to compare provider routes and sourced benchmarks.

Portfolio context: 2 decision-task tags, 9 active tracked models, latest research stamp 2026-05-22.

Use this portfolio page for

  • Teams evaluating coding and agents across this lab's releases
  • Readers comparing families before locking a flagship SKU
  • 9 tracked SKUs for migration and pricing follow-ups

Do not stop here for

  • Choosing a hosting provider without opening a model page for price ladders

Active models

9

Non-deprecated SKUs linked to this researcher

Active families

2

Non-obsolete families in coverage

Open catalog

0 OSS

0 open-weight (text match)

Decision task tags

2

Mapped to the site-wide task taxonomy

Latest dated release

2024-08-01

Cerebras LLaVA 13B

Freshness

2026-05-22

Researched 13d ago

fresh

Release cadence

Showing 5 recent dated ships (full timeline below). Latest spotlight: Cerebras LLaVA 13B (2024-08-01).

Where this lab wins

  • Coding: 1 tracked model with SWE-bench / HumanEval-style scores.
  • Agentic: 1 tracked model with BFCL, tau-bench, and SWE-bench tool-use coverage.

Flagship quality / price signal

Anchor SKU: Cerebras GPT 13B (best sourced coding Q/$ in this portfolio).

Quality / dollar unavailable for this anchor — missing benchmark coverage and/or output token price on the cheapest ladder route (open the model detail after pricing lands).

Cerebras is an American AI research organization founded in 2016. World's largest AI chip innovation. Cerebras ships 2 model families totaling 9 models, with the most recent release Cerebras LLaVA 13B in 2024-08. Notable families include Cerebras LLaVA and Cerebras GPT. Use it as a stable reference for lab background, release coverage, and follow-up model pages as they are added. View official API endpoints, benchmark performance, and coding/agent fit for every Cerebras model.

About

Cerebras Systems is a forward-thinking leader in the AI domain, best known for its innovation in generative AI and the development of large language models (LLMs). Established in 2016 in Sunnyvale, California, the company was founded by a group of illustrious computer scientists and deep learning experts, who aimed to tackle one of the most challenging obstacles in the AI industry: the acceleration and efficiency of deep learning processes. Leveraging their past experiences, particularly from SeaMicro, the founding team brought groundbreaking ideas to the forefront of AI technology. At the heart of Cerebras' groundbreaking work is its pioneering wafer-scale engines (WSEs). These colossal chips transcend the capacity of traditional processors by incorporating compute, memory, and interconnect fabric on a single, expansive wafer. This distinct architecture is designed to significantly reduce data transfer and latency, offering unprecedented performance improvements in AI training and inference. The evolution of this technology from the WSE-2 to the WSE-3 signifies the company's relentless pursuit of excellence, culminating in systems such as the Cerebras CS-2 and CS-3, renowned for their remarkable speeds and capabilities. Cerebras' contributions to generative AI are substantial, highlighted by its collaboration with the Mayo Clinic and others in utilizing their powerful hardware to push the boundaries of AI applications in medical fields and other domains. The company's strategic release of seven open-source GPT-based large language models, trained on the highly advanced Andromeda AI supercluster, underscores its commitment and capacity to drive large-scale AI model training efficiently. This initiative not only illustrates Cerebras' proficiency in developing sophisticated AI models but also reflects its dedication to nurturing the AI community through open-source advancements. Navigating a competitive landscape dominated by tech giants like Nvidia, Cerebras has positioned itself as a formidable challenger. By capitalizing on the superior performance and cost-effectiveness of its wafer-scale technologies for large-scale AI applications, the company has secured a robust foothold within the AI chip market. Moreover, its notable revenue growth and strategic partnerships, especially with organizations such as G42, further extend its influence and credibility in the industry. Yet, like any ambitious venture, Cerebras is mindful of potential challenges, particularly its current financial reliance on key partners such as G42, as it looks towards its future growth and potential IPO efforts.

Featured models

ModelReleasedContextInput price ($/1M)Output price ($/1M)License
Cerebras LLaVA 13B2024-08-014k--Unknown
Cerebras LLaVA 7B2024-08-014k--Unknown
Cerebras GPT 13B2023-03-132k--Unknown

Model families

Recent releases

  1. Cerebras LLaVA 13B- 2024-08-01
  2. Cerebras LLaVA 7B- 2024-08-01
  3. Cerebras GPT 13B- 2023-03-13
  4. Cerebras GPT 7B- 2023-03-13
  5. Cerebras GPT 2.7B- 2023-03-13

FAQ

Who founded Cerebras and when?

Cerebras was founded in 2016 and is associated with Sunnyvale, California, United States.

What models has Cerebras released?

Cerebras ships 9 models across 2 families: Cerebras LLaVA and Cerebras GPT.

Is Cerebras's technology open source?

Cerebras's tracked models are primarily proprietary, with some license details still being verified.

Where is Cerebras headquartered?

Cerebras is headquartered in Sunnyvale, California, United States.

What is Cerebras known for?

World's largest AI chip innovation. Its most prominent tracked family is Cerebras LLaVA.

How can I access Cerebras's models?

Cerebras's provider availability is tracked on model pages as API and hosting data is verified.

Explore related pages

Last reviewed: 2026-05-22. Data sourced from public lab announcements and provider documentation.