Quick Start
- 1
- 2Use the Novita AI SDK or REST API to call
qwen3.6-35b-a3b. - 3
Code Examples
Code examples for this provider have not been sourced yet.
About Novita AI
Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Pricing on Novita AI
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.25 |
| Output tokens | $1.49 |
Capabilities
MultimodalFunction CallingTool Use
About Qwen3.6-35B-A3B
Qwen3.6-35B-A3B is an open-weight multimodal MoE model with 35B total parameters and 3B activated per token, released April 2026. It features a hybrid architecture combining Gated DeltaNet linear attention and standard Gated Attention with 256 total experts (8 routed + 1 shared), and includes a vision encoder for image and video understanding. Optimized for agentic coding, long-context reasoning, and visual tasks; supports 256K native context (extensible to ~1M via YaRN) with integrated thinking mode for multi-turn agent interactions.
Model Specs
Released2026-04-16
Parameters35B
Context262K
Architecturemoe