LLM Reference

Using Qwen3.5-35B-A3B on Novita AI

Implementation guide · Qwen3.5 · Alibaba

Serverless

Quick Start

  1. 1
    Create an account at Novita AI and generate an API key.
  2. 2
    Use the Novita AI SDK or REST API to call qwen3.5-35b-a3b.
  3. 3
    You'll be billed $0.25/1M input, $2.00/1M output tokens. See full pricing.

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Pricing on Novita AI

TypePrice (per 1M)
Input tokens$0.25
Output tokens$2.00

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

About Qwen3.5-35B-A3B

Alibaba's Qwen3.5-35B-A3B is a Mixture-of-Experts model released February 24, 2026, with 35B total parameters and 3B active during inference. Part of the Qwen3.5 series with a 262K native context window (extendable to ~1M tokens). Optimized for high inference throughput (78+ tokens/second on NVIDIA hardware). Open-source under Apache 2.0.

Model Specs

Released2026-02-24
Parameters35B
Context262K
ArchitectureMixture of Experts

Provider

Novita AI