LLM ReferenceLLM Reference

Transcribe (03-2026)

cohere-transcribe-03-2026

Researched 137d ago

Last refreshed 2026-04-18. Next refresh: weekly.

Open SourceMultimodalVision

Transcribe (03-2026) has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Decision context: Vision task fit, 0 tracked provider routes, and research from 2026-01-01.

Use it for

  • Teams evaluating vision

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Strict JSON or tool-calling flows
  • Teams that need a tracked hosted API route today

Cheapest output

-

No tracked output price

Provider routes

0

No provider route in seed

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-01-01

Researched 137d ago

stale

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Cohere's state-of-the-art automatic speech recognition (ASR) model. Transcribe is a 2B parameter Conformer-based encoder-decoder model trained from scratch for high-fidelity transcription across 14 languages: English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Chinese (Mandarin), Japanese, Korean, Vietnamese, and Arabic. Can process 525 minutes of audio per minute. Achieves 5.42 WER on Hugging Face Open ASR leaderboard.

Capabilities

Multimodal

Rankings

Specifications

Released2026-03-01
Parameters2B
Architectureconformer
Specializationaudio
LicenseApache 2.0

Created by

Empowering developers with advanced language AI.

Toronto, Ontario, Canada
Founded 2022
Website