MOSS-TTS-v1.5
MOSS-TTS-v1.5 has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- MOSS-TTS
- Released
- 2026-05-26
- Parameters
- 8B
- Architecture
- MossTTSDelay
- Specialization
- text-to-speech
- Openness
- Open source
- License
- Apache 2.0(OSI)Commercial use allowed
- Training
- pretrained
No tracked provider token pricing is available yet.
About
MOSS-TTS-v1.5 is an open-weight multilingual text-to-speech model from MOSI AI and the OpenMOSS team. The 8B-parameter MossTTSDelay model supports zero-shot voice cloning, long-form speech generation, explicit pause control with [pause X.Ys] markers, and language-tagged multilingual synthesis across 31 languages. Version 1.5 improves on MOSS-TTS v1.0 with stronger multilingual synthesis, more stable voice cloning, better long-reference short-text handling, and punctuation-driven prosody. The model weights are available on Hugging Face under Apache 2.0; no hosted token-priced API route is confirmed in the June 2026 research handoff.
MOSS-TTS-v1.5 is an open-source model in the MOSS-TTS family. The structured metadata tracks audio. No headline benchmark score is tracked for MOSS-TTS-v1.5 yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
API versions
MOSS-TTS-v1.5No tracked provider token pricing is available yet.