What is the context window of Voxtral Mini Transcribe 2?

Voxtral Mini Transcribe 2 has a context window of 33k tokens.

When was Voxtral Mini Transcribe 2 released?

Voxtral Mini Transcribe 2 was released on 2026-02-04.

Which providers offer Voxtral Mini Transcribe 2?

Voxtral Mini Transcribe 2 is available from 1 provider: Mistral AI Studio.

What benchmarks has Voxtral Mini Transcribe 2 been tested on?

Voxtral Mini Transcribe 2 has been evaluated on 1 benchmark, including Artificial Analysis ASR WER.

Voxtral Mini Transcribe 2

Name: Voxtral Mini Transcribe 2
Author: MistralAI

Released

2026-02-04

Last refreshed

2026-06-15

Status

Researched 44d ago

ProprietaryCommercial use: conditionalMultimodalVision

Voxtral Mini Transcribe 2 is worth evaluating for vision when its provider route and context window match the workload.

Use it for

Teams evaluating vision
Workloads that can use a 33k context window
Buyers comparing 1 tracked provider route

Do not use it for

Strict JSON or tool-calling flows

Specifications

Family: Voxtral
Released: 2026-02-04
Context: 33k
Architecture: Decoder Only
Specialization: speech-to-text
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Training: Pretrained

Created by

MistralAI

Enterprise AI solutions for trust and transparency.

Paris, France

Founded 2023

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · Mistral AI Studio

Providers(1)

Mistral AI Studio

View 1 provider route

Links

Website

About

Batch speech-to-text transcription model with speaker diarization. Public Mistral pricing is $0.003 per minute.

Voxtral Mini Transcribe 2 is a proprietary model in the Voxtral family. The structured metadata tracks a 33k-token context window, multimodal input, and audio. This page tracks provider routes through Mistral AI Studio. Headline tracked benchmarks include Artificial Analysis ASR WER 3.6.

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Mistral AI Studio	-	-	ServerlessPartial

Available via routers & gateways(10)

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeMistral AI Studio

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMistral AI Studio

Martian

Router

AI-powered LLM router that analyzes each prompt in real-time to select the optimal model, targeting 20–97% cost reduction while maintaining quality; San Francisco startup reportedly nearing $1.3B valuation.

Passthrough + feeMistral AI Studio

Neutrino AI

Router

Commercial LLM router that dynamically routes each query to the best-suited model with load balancing and fallback handling, charging 3% of underlying AI spend.

Passthrough + feeMistral AI Studio

Not Diamond

Router

Predictive model router that determines the best LLM for each query; claims up to 25% accuracy gains and 10x cost reduction; powers OpenRouter's auto mode and is positioned specifically for coding agents.

Enterprise quoteMistral AI Studio

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughMistral AI Studio

Capabilities

MultimodalAudio

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Benchmark scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.