Using Qwen3.6-35B-A3B on Novita AI

Implementation guide · Qwen3.6 · Alibaba

ServerlessOpen Source

Quick Start

1
Create an account at Novita AI and generate an API key.
2
Use the Novita AI SDK or REST API to call qwen3.6-35b-a3b.
3
You'll be billed $0.25/1M input, $1.49/1M output tokens. See full pricing.

API Portal Pricing Model Card

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

View all models on Novita AI →

Pricing on Novita AI

Type	Price (per 1M)
Input tokens	$0.25
Output tokens	$1.49

Capabilities

VisionMultimodalFunction CallingTool Use

About Qwen3.6-35B-A3B

Qwen3.6-35B-A3B is an open-weight multimodal MoE model with 35B total parameters and 3B activated per token, released April 2026. It features a hybrid architecture combining Gated DeltaNet linear attention and standard Gated Attention with 256 total experts (8 routed + 1 shared), and includes a vision encoder for image and video understanding. Optimized for agentic coding, long-context reasoning, and visual tasks; supports 256K native context (extensible to ~1M via YaRN) with integrated thinking mode for multi-turn agent interactions.

Full model details →

Model Specs

Released2026-04-16

Parameters35B

Context262k

ArchitectureMixture of Experts