Voxtral TTS
voxtral-tts-2603
ProprietaryMultimodal
About
Text-to-speech model with zero-shot voice cloning, multilingual output, and real-time streaming support.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning