Qwen3-Coder-30B-A3B-Instruct

Name: Qwen3-Coder-30B-A3B-Instruct
Author: Alibaba

Released

2025-12-03

Last refreshed

2026-06-29

Status

Researched 16d ago

Open sourceCommercial use: permittedCodingRAGAgentsLong contextClassificationJSON / Tool use

Qwen3-Coder-30B-A3B-Instruct is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 3 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Qwen3-Coder
Released: 2025-12-03
Context: 262k
Parameters: 30.5B total, 3.3B active
Architecture: Mixture of Experts
Specialization: code
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Available
Code: Unknown
Training: Pretrained

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$0.270

Input / 1M

$0.070

Cheapest of 3 routes · Novita AI

Providers(3)

AWS Bedrock Vercel AI Gateway Novita AI

View 3 provider routes

Links

Website HuggingFace

About

Qwen3-Coder-30B-A3B-Instruct is Alibaba's efficient open-source code generation model in the Qwen3-Coder family, released December 3, 2025 under the Apache 2.0 license. The model has 30.5 billion total parameters with 3.3 billion active per forward pass, organized across 48 transformer layers with 128 experts and 8 activated per token. It uses Grouped Query Attention (GQA) with 32 query heads and 4 key-value heads. Native context window is 262,144 tokens, extendable to 1 million tokens via YaRN. The model supports multi-turn tool calling, function calling, repository-level code understanding, and structured outputs. It is compatible with vLLM, SGLang, Ollama, LM Studio, llama.cpp, and HuggingFace Transformers. Available via AWS Bedrock, Novita AI, and Vercel AI Gateway.

Qwen3-Coder-30B-A3B-Instruct is an open-source model in the Qwen3-Coder family. The structured metadata tracks a 262k-token context window, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through AWS Bedrock, Vercel AI Gateway, and Novita AI, with the cheapest tracked route listed at $0.07 input and $0.27 output per 1M tokens. No headline benchmark score is tracked for Qwen3-Coder-30B-A3B-Instruct yet.

Top use-case fit: coding, agents, and build tasks

Coding

Included by capability and metadata signals in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Novita AI	$0.070	$0.270	Serverless
Vercel AI Gateway	$0.150	$0.600	Serverless
AWS Bedrock	$0.150	$0.620	Serverless

Available via routers & gateways(1)

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock