What is MOSS-TTS used for?

MOSS-TTS is used for audio and text to speech. The family description and listed model capabilities point to those workloads as the best fit.

How does MOSS-TTS compare to MOSS-Audio?

MOSS-TTS by MOSI AI is strongest where you need audio, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. MOSS-TTS has 1 listed variant, so compare the specs and pricing tables before choosing a production model.

Which MOSS-TTS model should I use?

If price is the main constraint, use the pricing table first because MOSS-TTS does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate MOSS-TTS-v1.5.

MOSS-TTS Models by MOSI AI

MOSI AIApache 2.0Open sourceOpen SourceAudio

1 model2026

Details

ResearcherMOSI AI

LicenseApache 2.0OSI-approved

Commercial useCommercial use: permitted

Models1

Released2026

Links

Website HuggingFace

About

MOSS-TTS is an open-source speech and sound generation model family from MOSI AI and the OpenMOSS team, designed for high-fidelity, high-expressiveness text-to-speech across complex real-world scenarios. The family covers stable long-form speech, multi-speaker dialogue, voice and character design, environmental sound effects, and real-time streaming TTS. Models include MOSS-TTS v1.0, MOSS-TTS-v1.5, MOSS-TTS-Nano, MOSS-SoundEffect, MOSS-TTSD, and MOSS-VoiceGenerator.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view

MOSS-TTS-v1.5Current

Use when the workload needs text to speech, 8B parameters, and audio.

2026-05text to speech8B parametersaudio

Current MOSS-TTS variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
MOSS-TTS-v1.5	Use when the workload needs text to speech, 8B parameters, and audio.	2026-05	text to speech8B parametersaudio	Current

Release Timeline

1 release group

2026-05

1 current

MOSS-TTS-v1.5

text to speech8B parametersaudio

Current

Specifications(1 models)

MOSS-TTS model specifications comparison
Model	Released	Parameters
MOSS-TTS-v1.5	2026-05	8B

Frequently Asked Questions

What is MOSS-TTS used for?: MOSS-TTS is used for audio and text to speech. The family description and listed model capabilities point to those workloads as the best fit.
How does MOSS-TTS compare to MOSS-Audio?: MOSS-TTS by MOSI AI is strongest where you need audio, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. MOSS-TTS has 1 listed variant, so compare the specs and pricing tables before choosing a production model.
Which MOSS-TTS model should I use?: If price is the main constraint, use the pricing table first because MOSS-TTS does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate MOSS-TTS-v1.5.

Models(1)