MOSS-TTS Models by MOSI AI
1 model2026
Details
ResearcherMOSI AI
LicenseApache 2.0(OSI)
Commercial useCommercial use allowed
Models1
Released2026
About
MOSS-TTS is an open-source speech and sound generation model family from MOSI AI and the OpenMOSS team, designed for high-fidelity, high-expressiveness text-to-speech across complex real-world scenarios. The family covers stable long-form speech, multi-speaker dialogue, voice and character design, environmental sound effects, and real-time streaming TTS. Models include MOSS-TTS v1.0, MOSS-TTS-v1.5, MOSS-TTS-Nano, MOSS-SoundEffect, MOSS-TTSD, and MOSS-VoiceGenerator.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
1 in view
MOSS-TTS-v1.5Current
Use when the workload needs text to speech, 8B parameters, and audio.
2026-05text to speech8B parametersaudio
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| MOSS-TTS-v1.5 | Use when the workload needs text to speech, 8B parameters, and audio. | 2026-05 | text to speech8B parametersaudio | Current |
Release Timeline
1 release group2026-05
1 current
MOSS-TTS-v1.5
Currenttext to speech8B parametersaudio
Specifications(1 models)
| Model | Released | Parameters |
|---|---|---|
| MOSS-TTS-v1.5 | 2026-05 | 8B |
Frequently Asked Questions
- What is MOSS-TTS used for?
- MOSS-TTS is used for audio and text to speech. The family description and listed model capabilities point to those workloads as the best fit.
- How does MOSS-TTS compare to MOSS-Audio?
- MOSS-TTS by MOSI AI is strongest where you need audio, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. MOSS-TTS has 1 listed variant, so compare the specs and pricing tables before choosing a production model.
- Which MOSS-TTS model should I use?
- If price is the main constraint, use the pricing table first because MOSS-TTS does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate MOSS-TTS-v1.5.