llmreference
Together AI

Using DBRX Instruct on Together AI

Implementation guide · DBRX · Databricks Mosaic

Serverless

Quick Start

  1. 1
    Create an account at Together AI and generate an API key.
  2. 2
    Use the Together AI SDK or REST API to call dbrx-instruct — see the documentation for request format.
  3. 3
    You'll be billed $1.20/1M input, $1.20/1M output tokens. See full pricing.

Code Examples

Install
pip install together
API key
TOGETHER_API_KEY
Model ID
dbrx-instruct

Together uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.

from together import Together

client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="dbrx-instruct",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Together AI

Platform for running open-source and proprietary LLMs

Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Pricing on Together AI

TypePrice (per 1M)
Input tokens$1.20
Output tokens$1.20

Capabilities

Structured Outputs

About DBRX Instruct

DBRX Instruct, developed by Databricks, is a cutting-edge large language model designed for various natural language processing tasks. It excels in text summarization, question answering, information extraction, and code generation, utilizing a fine-grained mixture-of-experts architecture with 132 billion parameters. With advanced features like rotary position encodings, gated linear units, and grouped query attention, it performs exceptionally across multiple benchmarks, even outperforming some closed-source models. Trained on a vast 12 trillion token dataset, it supports contexts up to 32,000 tokens. Although primarily effective in English, its multilingual strength isn't fully explored. Users should be cautious as it may generate inaccurate or biased outputs.

Model Specs

Released2024-03-27
Parameters132B
Context32K
ArchitectureMixture of Experts

Provider

Together AI
Together AI

San Francisco, California, United States