LLM Reference

Using Qwen3 Omni 30B A3B on Novita AI

Implementation guide · Qwen3 Omni · Alibaba

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Novita AI and generate an API key.
  2. 2
    Use the Novita AI SDK or REST API to call qwen3-omni-30b-a3b-thinking.
  3. 3
    You'll be billed $0.25/1M input, $0.97/1M output tokens. See full pricing.

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Pricing on Novita AI

TypePrice (per 1M)
Input tokens$0.25
Output tokens$0.97

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsAudio

About Qwen3 Omni 30B A3B

Qwen3 Omni 30B A3B is Alibaba's natively end-to-end omnimodal MoE model from the Qwen3 generation, capable of processing text, audio, images, and video while generating real-time streaming text and speech responses. Achieves SOTA on 22 of 36 audio/video benchmarks and open-source SOTA on 32 of 36. Available in Instruct and Thinking (reasoning) variants. Released September 22, 2025.

Model Specs

Released2025-09-22
Parameters30B total / 3B active
Context66K
Architecturemoe

Provider

Novita AI