LLM ReferenceLLM Reference
Baseten API

Mixtral 8x22B v0.1 on Baseten API

Mixtral · MistralAI

Serverless

Last refreshed 2026-04-19. Next refresh: weekly.

Why use Mixtral 8x22B v0.1 on Baseten API?

Baseten API offers Mixtral 8x22B v0.1 with competitive pricing. Baseten is an AI infrastructure platform that provides comprehensive tools for deploying and serving machine learning models efficiently and cost-effectively.

Compare Mixtral 8x22B v0.1 across 8 providers to find the best fit for your use case
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: mixtral-8x22b-v0.1
Model ID
mixtral-8x22b-v0.1

Request example

Curated snippets for this provider are not sourced yet. Use Baseten API documentation with model ID mixtral-8x22b-v0.1.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare Mixtral 8x22B v0.1 Across Providers

ProviderInput (per 1M)Output (per 1M)
NVIDIA NIM
OctoAI API (Deprecated)$1.20$1.20
Fireworks AI$1.20$1.20
DeepInfra$0.65$0.65
Baseten API
View all 8 providers →

Capabilities

No model capability flags are currently sourced.

About Mixtral 8x22B v0.1

The Mixtral 8x22B v0.1 is a pretrained generative Sparse Mixture of Experts (MoE) model created by Mistral AI [1][2][4]. It utilizes a specialized architecture where different sub-models, termed "experts," manage distinct input segments, enhancing both efficiency and performance relative to traditional large language models [2][10][12]. This model features an impressive 176 billion parameters and supports a context length of 65,000 tokens [10][13]. It excels in text generation, completion, and question answering, outperforming models like LLaMA 2 70B on various benchmarks [4][5][7]. Nonetheless, as a base model, it lacks inherent moderation capabilities, potentially generating inappropriate or harmful content without filtration [2][4][10]. The model requires significant VRAM—approximately 260GB in FP16 mode and 73GB in INT4 mode—for optimal operation [10][13] and may struggle with complex contextual understanding and current knowledge. Enhanced instruct-tuned versions, such as the Mixtral-8x22B-Instruct-v0.1, address some limitations by improving instruction adherence [3][5][6].

FAQ

What is the context window for Mixtral 8x22B v0.1 on Baseten API?

Mixtral 8x22B v0.1 supports a 64,000 token context window on Baseten API.

How does Baseten API compare to other Mixtral 8x22B v0.1 providers?

Mixtral 8x22B v0.1 is available from 8 providers. The cheapest input pricing is $0.65/1M tokens from DeepInfra.

Who created Mixtral 8x22B v0.1?

Mixtral 8x22B v0.1 was created by MistralAI as part of the Mixtral model family.

Is Mixtral 8x22B v0.1 open source?

Mixtral 8x22B v0.1 is open source under Apache 2.0 according to the seed data.

Get Started

Model Specs

Released2024-04-17
Parameters8x22B
Context64K
ArchitectureMixture of Experts

GPU-Hour Providers(1)

Related Models on Baseten API