LLM Reference

Using Qwen3.6-35B-A3B on Novita AI

Implementation guide · Qwen3.6 · Alibaba

Serverless

Quick Start

  1. 1
    Create an account at Novita AI and generate an API key.
  2. 2
    Use the Novita AI SDK or REST API to call qwen3.6-35b-a3b.
  3. 3
    You'll be billed $0.25/1M input, $1.49/1M output tokens. See full pricing.

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Pricing on Novita AI

TypePrice (per 1M)
Input tokens$0.25
Output tokens$1.49

Capabilities

MultimodalFunction CallingTool Use

About Qwen3.6-35B-A3B

Qwen3.6-35B-A3B is an open-weight multimodal MoE model with 35B total parameters and 3B activated per token, released April 2026. It features a hybrid architecture combining Gated DeltaNet linear attention and standard Gated Attention with 256 total experts (8 routed + 1 shared), and includes a vision encoder for image and video understanding. Optimized for agentic coding, long-context reasoning, and visual tasks; supports 256K native context (extensible to ~1M via YaRN) with integrated thinking mode for multi-turn agent interactions.

Model Specs

Released2026-04-16
Parameters35B
Context262K
Architecturemoe

Provider

Novita AI