MAI-Transcribe-1.5
MAI-Transcribe-1.5 is worth evaluating for vision when its provider route and context window match the workload.
Use it for
- Teams evaluating vision
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
- Family
- MAI
- Released
- 2026-06-02
- Architecture
- transformer
- Specialization
- speech-recognition
- License
- Proprietary
- Training
- finetuned
Cheapest of 1 route · Microsoft Foundry
About
MAI-Transcribe-1.5 is Microsoft AI's second-generation speech-to-text transcription model. It supports 43 languages, domain-specific terminology recognition, and Microsoft-reported five-times-faster transcription than competing models while maintaining state-of-the-art accuracy. Streaming support was announced as coming soon at launch.
MAI-Transcribe-1.5 is a proprietary model in the MAI family. The structured metadata tracks multimodal input and audio. This page tracks provider routes through Microsoft Foundry. No headline benchmark score is tracked for MAI-Transcribe-1.5 yet.
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Microsoft Foundry | - | - | ServerlessPartial |
Capabilities
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.