Mixtral 8x22B v0.1 on Together AI

Name: Mixtral 8x22B v0.1 on Together AI
Brand: MistralAI
SKU: mixtral-8x22b-v0.1-together-ai
Price: 1.2 USD

Mixtral · MistralAI

ServerlessOpen Source

Last refreshed 2026-06-15. Next refresh: weekly.

Why use Mixtral 8x22B v0.1 on Together AI?

Together AI offers Mixtral 8x22B v0.1 with pay-as-you-go pricing at $1.20/1M input tokens. Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Compare Mixtral 8x22B v0.1 across 8 providers to find the best fit for your use case

Input / 1M

$1.20

Output / 1M

$1.20

Cache

Not sourced

Batch

Not sourced

Setup recipe

Python + curl

Install

pip install together

Auth

export TOGETHER_API_KEY=...

Call

from together import Together
client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="mixtral-8x22b-v0.1",

Model ID

mixtral-8x22b-v0.1

Request example

from together import Together

client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="mixtral-8x22b-v0.1",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

Together uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.
The examples expect TOGETHER_API_KEY; rename it only if your application config maps the new variable.

Compare Mixtral 8x22B v0.1 Across Providers

Provider	Input (per 1M)	Output (per 1M)
NVIDIA NIM	—	—
OctoAI API (Deprecated)	$1.20	$1.20
Fireworks AI	$1.20	$1.20
DeepInfra	$0.65	$0.65
Baseten API	—	—

View all 8 providers →

Pricing

Type	Price (per 1M)
Input tokens	$1.20
Output tokens	$1.20

Capabilities

No model capability flags are currently sourced.

About Mixtral 8x22B v0.1

The Mixtral 8x22B v0.1 is a pretrained generative Sparse Mixture of Experts (MoE) model created by Mistral AI [1][2][4]. It utilizes a specialized architecture where different sub-models, termed "experts," manage distinct input segments, enhancing both efficiency and performance relative to traditional large language models [2][10][12]. This model features an impressive 176 billion parameters and supports a context length of 65,000 tokens [10][13]. It excels in text generation, completion, and question answering, outperforming models like LLaMA 2 70B on various benchmarks [4][5][7]. Nonetheless, as a base model, it lacks inherent moderation capabilities, potentially generating inappropriate or harmful content without filtration [2][4][10]. The model requires significant VRAM—approximately 260GB in FP16 mode and 73GB in INT4 mode—for optimal operation [10][13] and may struggle with complex contextual understanding and current knowledge. Enhanced instruct-tuned versions, such as the Mixtral-8x22B-Instruct-v0.1, address some limitations by improving instruction adherence [3][5][6].

FAQ

What does Mixtral 8x22B v0.1 cost on Together AI?

On Together AI, Mixtral 8x22B v0.1 costs $1.2 per 1M input tokens and $1.2 per 1M output tokens.

What is the context window for Mixtral 8x22B v0.1 on Together AI?

Mixtral 8x22B v0.1 supports a 64k token context window on Together AI.

How does Together AI compare to other Mixtral 8x22B v0.1 providers?

Mixtral 8x22B v0.1 is available from 8 providers. The cheapest input pricing is $0.65/1M tokens from DeepInfra.

Who created Mixtral 8x22B v0.1?

Mixtral 8x22B v0.1 was created by MistralAI as part of the Mixtral model family.

Is Mixtral 8x22B v0.1 open source?

Mixtral 8x22B v0.1 is open source under Apache 2.0 according to the seed data.

Get Started

Docs Portal Playground Pricing

Model Specs

Released2024-04-17

Parameters8x22B

Context64k

ArchitectureMixture of Experts

Knowledge cutoff2024-01