Higgs Audio Models by Boson AI
1 model2026Up to 8k ctx
Details
ResearcherBoson AI
LicenseNoncommercial
Commercial useNon-commercial only
Models1
Released2026
Max context8k
Links
WebsiteAbout
Boson AI's Higgs Audio family of text-audio foundation models, spanning TTS (v3 TTS) and STT (v3 STT) variants. Designed for voice agents with zero-shot voice cloning, support for 100+ languages, and inline control over emotion, style, prosody, and sound effects.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
1 in view
Higgs Audio v3 TTSCurrent
Use when the workload needs text to speech, 8k context, and 4B parameters.
2026-06text to speech8k context4B parameters
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Higgs Audio v3 TTS | Use when the workload needs text to speech, 8k context, and 4B parameters. | 2026-06 | text to speech8k context4B parameters | Current |
Release Timeline
1 release group2026-06
1 current
Higgs Audio v3 TTS
Currenttext to speech8k context4B parameters
Specifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Higgs Audio v3 TTS | 2026-06 | 8k | 4B |
Available From(1 provider)
Frequently Asked Questions
- What is Higgs Audio used for?
- Higgs Audio is used for text to speech and agent workflows. The family description and listed model capabilities point to those workloads as the best fit.
- How does Higgs Audio compare to Claude 3?
- Higgs Audio by Boson AI is strongest where you need text to speech, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Higgs Audio has 1 listed variant and reaches up to 8k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
- Which Higgs Audio model should I use?
- If price is the main constraint, use the pricing table first because Higgs Audio does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Higgs Audio v3 TTS with 8k context.