Voxtral Mini Transcribe 2
Voxtral Mini Transcribe 2 is worth evaluating for vision when its provider route and context window match the workload.
Use it for
- Teams evaluating vision
- Workloads that can use a 33k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
- Family
- Voxtral
- Released
- 2026-02-04
- Context
- 33k
- Architecture
- Decoder Only
- Specialization
- speech-to-text
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
- Training
- Pretrained
Cheapest of 1 route · Mistral AI Studio
About
Batch speech-to-text transcription model with speaker diarization. Public Mistral pricing is $0.003 per minute.
Voxtral Mini Transcribe 2 is a proprietary model in the Voxtral family. The structured metadata tracks a 33k-token context window, multimodal input, and audio. This page tracks provider routes through Mistral AI Studio. Headline tracked benchmarks include Artificial Analysis ASR WER 3.6.
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Mistral AI Studio | - | - | ServerlessPartial |
Available via routers & gateways(10)
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
Martian
RouterAI-powered LLM router that analyzes each prompt in real-time to select the optimal model, targeting 20–97% cost reduction while maintaining quality; San Francisco startup reportedly nearing $1.3B valuation.
Neutrino AI
RouterCommercial LLM router that dynamically routes each query to the best-suited model with load balancing and fallback handling, charging 3% of underlying AI spend.
Not Diamond
RouterPredictive model router that determines the best LLM for each query; claims up to 25% accuracy gains and 10x cost reduction; powers OpenRouter's auto mode and is positioned specifically for coding agents.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Capabilities
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Artificial Analysis ASR WER | 3.6 | aa-wer | https://artificialanalysis.ai/speech-to-text |
Migration checks
No linked migration route is available for this model yet.
Cheapest of 1 route · Mistral AI Studio