LLM Reference

Transcribe (03-2026)

Open SourceMultimodal

About

Cohere's state-of-the-art automatic speech recognition (ASR) model. Transcribe is a 2B parameter Conformer-based encoder-decoder model trained from scratch for high-fidelity transcription across 14 languages: English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Chinese (Mandarin), Japanese, Korean, Vietnamese, and Arabic. Can process 525 minutes of audio per minute. Achieves 5.42 WER on Hugging Face Open ASR leaderboard.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

Rankings

Specifications

Released2026-03-01
Parameters2B
Architectureconformer
Specializationspeech-to-textaudio
LicenseApache 2.0