Transcribe (03-2026)
Open SourceMultimodal
About
Cohere's state-of-the-art automatic speech recognition (ASR) model. Transcribe is a 2B parameter Conformer-based encoder-decoder model trained from scratch for high-fidelity transcription across 14 languages: English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Chinese (Mandarin), Japanese, Korean, Vietnamese, and Arabic. Can process 525 minutes of audio per minute. Achieves 5.42 WER on Hugging Face Open ASR leaderboard.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution
Specifications
FamilyTranscribe
Released2026-03-01
Parameters2B
Architectureconformer
Specializationspeech-to-textaudio
LicenseApache 2.0