LLM Reference

Qwen3-Coder-30B-A3B-Instruct

Released
2025-12-03
Last refreshed
2026-06-29
Status
Researched 16d ago
Open sourceCommercial use: permittedCodingRAGAgentsLong contextClassificationJSON / Tool use

Qwen3-Coder-30B-A3B-Instruct is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 262k context window
  • Buyers comparing 3 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
Specifications
Released
2025-12-03
Context
262k
Parameters
30.5B total, 3.3B active
Architecture
Mixture of Experts
Specialization
code
Openness
Open source
License
Apache 2.0OSI-approvedCommercial use: permitted
Weights
Available
Code
Unknown
Training
Pretrained
Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website
Pricing
Output / 1M
$0.270
Input / 1M
$0.070

Cheapest of 3 routes · Novita AI

About

Qwen3-Coder-30B-A3B-Instruct is Alibaba's efficient open-source code generation model in the Qwen3-Coder family, released December 3, 2025 under the Apache 2.0 license. The model has 30.5 billion total parameters with 3.3 billion active per forward pass, organized across 48 transformer layers with 128 experts and 8 activated per token. It uses Grouped Query Attention (GQA) with 32 query heads and 4 key-value heads. Native context window is 262,144 tokens, extendable to 1 million tokens via YaRN. The model supports multi-turn tool calling, function calling, repository-level code understanding, and structured outputs. It is compatible with vLLM, SGLang, Ollama, LM Studio, llama.cpp, and HuggingFace Transformers. Available via AWS Bedrock, Novita AI, and Vercel AI Gateway.

Qwen3-Coder-30B-A3B-Instruct is an open-source model in the Qwen3-Coder family. The structured metadata tracks a 262k-token context window, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through AWS Bedrock, Vercel AI Gateway, and Novita AI, with the cheapest tracked route listed at $0.07 input and $0.27 output per 1M tokens. No headline benchmark score is tracked for Qwen3-Coder-30B-A3B-Instruct yet.

Top use-case fit: coding, agents, and build tasks

Coding

Included by capability and metadata signals in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Novita AI$0.070$0.270
Serverless
Vercel AI Gateway$0.150$0.600
Serverless
AWS Bedrock$0.150$0.620
Serverless

Available via routers & gateways(1)

Capabilities

Function CallingTool UseStructured OutputsCode Execution

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Qwen3-Coder-30B-A3B-Instruct?

Qwen3-Coder-30B-A3B-Instruct has a context window of 262k tokens.

How much does Qwen3-Coder-30B-A3B-Instruct cost?

Qwen3-Coder-30B-A3B-Instruct pricing ranges from $0.07/1M to $0.15/1M input tokens depending on the provider.

When was Qwen3-Coder-30B-A3B-Instruct released?

Qwen3-Coder-30B-A3B-Instruct was released on 2025-12-03.

Which providers offer Qwen3-Coder-30B-A3B-Instruct?

Qwen3-Coder-30B-A3B-Instruct is available from 3 providers: AWS Bedrock, Vercel AI Gateway, Novita AI.