Quick Start
- 1
- 2Use the ElevenLabs API SDK or REST API to call
scribe_v2— see the documentation for request format.
Code Examples
About ElevenLabs API
ElevenLabs' API provides hosted text-to-speech, speech-to-speech, dubbing, and voice models for creative production and realtime conversational applications.
Pricing on ElevenLabs API
Capabilities
Audio
About Scribe v2
Scribe v2 is ElevenLabs' current state-of-the-art batch speech-to-text model, released January 12, 2026. Improvements over v1 for long-form audio, extended silences, and tone changes. Supports 90+ languages, word-level timestamps, 32-speaker diarization, 56 entity types, and keyterm prompting (up to 1,000 terms). Base pricing: $0.22/hr; entity detection add-on: $0.07/hr; keyterm prompting add-on: $0.05/hr. API ID: scribe_v2.
Model Specs
Released2026-01-12
Architectureneural