Using Qwen3 Omni 30B A3B on Novita AI
Implementation guide · Qwen3 Omni · Alibaba
ServerlessOpen Source
Quick Start
- 1
- 2Use the Novita AI SDK or REST API to call
qwen3-omni-30b-a3b-thinking. - 3
Code Examples
Code examples for this provider have not been sourced yet.
About Novita AI
Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Pricing on Novita AI
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.25 |
| Output tokens | $0.97 |
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsAudio
About Qwen3 Omni 30B A3B
Qwen3 Omni 30B A3B is Alibaba's natively end-to-end omnimodal MoE model from the Qwen3 generation, capable of processing text, audio, images, and video while generating real-time streaming text and speech responses. Achieves SOTA on 22 of 36 audio/video benchmarks and open-source SOTA on 32 of 36. Available in Instruct and Thinking (reasoning) variants. Released September 22, 2025.
Model Specs
Released2025-09-22
Parameters30B total / 3B active
Context66K
Architecturemoe