LLM Reference

MAI-Transcribe-1

Proprietary

About

Enterprise-grade speech recognition model delivering accurate transcription across 25 languages at approximately 50% lower GPU cost than alternatives. Powers Copilot's Voice Mode transcriptions. Use cases include conversational AI, agent assist, live captioning, accessibility, media subtitling, and education platforms.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

Rankings

Specifications

FamilyMAI
Released2026-04-02
ArchitectureTransformer
SpecializationSpeech Recognition
TrainingSupervised learning on multilingual speech data

Created by

Advancing the state-of-the-art in AI and computing.

Redmond, Washington, United States
Founded 1991
Website