LLM Reference

DBRX Models by Databricks Mosaic

Databricks MosaicDBRXOpen weightsHighlightOpen Source
7 models2024Up to 33k ctxFrom $0.6/1M input

Details

LicenseDBRX
Commercial useCommercial use with conditions
Models7
Released2024
Max context33k

Capabilities

Structured Outputs2 of 7 models

About

The DBRX family of large language models (LLMs), developed by Databricks, includes DBRX Base and DBRX Instruct, both of which have been released under an open license. These transformer-based, decoder-only models employ a distinguished fine-grained mixture-of-experts (MoE) architecture. This approach leverages a larger number of smaller expert networks, enhancing the quality of output compared to models with fewer, larger experts. Trained on an extensive dataset of 12 trillion tokens encompassing text and code, these models showcase high performance with the capability to handle up to 32,000 tokens of context. DBRX Instruct is specially optimized for few-turn interactions, making it suitable for efficient conversational applications. The DBRX models not only excel in efficiency during training and inference but also outperform many leading closed and open-source models in various benchmarks. They are accessible for download on Hugging Face and integrated through Databricks' Foundation Model APIs.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

6 in view1 retired

Use when the workload needs 32k context and 229B parameters.

2024-1132k context229B parameters

Use when the workload needs 32k context and 20B parameters.

2024-0732k context20B parameters
DBRXCurrent

Use when the workload needs 32k context and 132B parameters.

2024-0332k context132B parameters

Use when the workload needs 33k context, 132B parameters, and structured outputs.

2024-0333k context132B parametersstructured outputs

Use when the workload needs 33k context and 132B parameters.

2024-0333k context132B parameters
DBRX BaseCurrent

Use when the workload needs 32k context and 132B parameters.

2024-0332k context132B parameters

Release Timeline

3 release groups
2024-11
1 current
DBRX Expanse
32k context229B parameters
Current
2024-07
1 current
DBRX Instruct 20B
32k context20B parameters
Current
2024-03
4 current · 1 retired
DBRX
32k context132B parameters
Current
DBRX Base
32k context132B parameters
Current
DBRX Instruct
32k context132B parametersstructured outputs
Archived
DeepInfra DBRX Instruct
33k context132B parametersstructured outputs
Current
Fireworks DBRX-Instruct
33k context132B parameters
Current

Specifications(7 models)

DBRX model specifications comparison
ModelReleasedContextParametersStructured Outputs
DBRX Expanse2024-1132k229BNo
DBRX Instruct 20B2024-0732k20BNo
DBRX2024-0332k132BNo
DeepInfra DBRX Instruct2024-0333k132BYes
Fireworks DBRX-Instruct2024-0333k132BNo
DBRX Base2024-0332k132BNo

Pricing

DBRX model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepInfra DBRX InstructDeepInfra$0.6$1.2Serverless
Fireworks DBRX-InstructFireworks AI$1.5$1.5Serverless

Frequently Asked Questions

What is DBRX used for?
DBRX is used for structured outputs and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does DBRX compare to MOSS-Audio?
DBRX by Databricks Mosaic is strongest where you need structured outputs, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. DBRX has 7 listed variants and reaches up to 33k context, so compare the specs and pricing tables before choosing a production model.
Which DBRX model should I use?
For the lowest listed input price, start with DBRX Instruct through DeepInfra at $0.6/1M input tokens. For the most capable/latest local choice, evaluate DeepInfra DBRX Instruct with 33k context and structured outputs.