Quick Start
- 1
- 2
- 3
Code Examples
About Deepgram API
Deepgram's API provides hosted speech-to-text, text-to-speech, and voice-agent audio models, including Nova, Flux, and Aura model families.
Pricing on Deepgram API
| Type | Rate |
|---|---|
| Audio minute | $0.004/min |
Capabilities
MultimodalAudio
About Deepgram Nova-2
Nova-2 is Deepgram's previous-generation flagship speech-to-text model, released September 2023. Delivers ~36% WER improvement over Whisper Large across tested domains (8.4% median WER), with improved entity recognition, punctuation, and capitalization. Supports 36+ languages and 10 domain-specific variants (general, meeting, phonecall, voicemail, finance, conversationalai, video, medical, drivethru, automotive). Batch: $0.0043/min; streaming: $0.0077/min. API ID: nova-2.
Model Specs
Released2023-09-19
Architectureneural