Using Higgs Audio v3 TTS on Boson AI API
Implementation guide · Higgs Audio · Boson AI
ServerlessOpen Weights
Quick Start
- 1
- 2Use the Boson AI API SDK or REST API to call
higgs-audio-v3-tts— see the documentation for request format.
Code Examples
About Boson AI API
Boson AI's hosted API for Higgs Audio TTS and STT models. Currently in free public preview with rate limits. Commercial licensing terms not yet disclosed.
Pricing on Boson AI API
Capabilities
Audio
About Higgs Audio v3 TTS
Higgs Audio v3 TTS is Boson AI's 4B-parameter text-to-speech model released June 4, 2026. It supports 102 languages (85 at production quality with WER/CER <5%), zero-shot voice cloning, and inline control tokens for emotion (21 types), style (singing/shouting/whispering), sound effects, and prosody. Audio output is 24kHz MP3 or PCM. Open weights available under a non-commercial license; hosted API is in free public preview.
Model Specs
Released2026-06-04
Parameters4B
Context8k
ArchitectureDecoder Only