EDITOR'S CHOICEResearch date unavailable
Whisper large-v3-turbo
OpenAI
Excellent
Low WER on the messy, real-world audio.
Lowest WER on noisy real-world audio with the broadest language coverage; cheap to self-host.
The numbers
Pricing
—
see model page
Context
—
stt
Pros
- +Best accuracy on noisy audio
- +98+ languages
- +Open weights
Cons
- −Diarization needs add-ons