LLM Reference

Using Qwen3-Coder-Next on Vercel AI Gateway

Implementation guide · Qwen3-Coder · Alibaba

Serverless

Quick Start

  1. 1
    Create an account at Vercel AI Gateway and generate an API key.
  2. 2
    Use the Vercel AI Gateway SDK or REST API to call alibaba/qwen3-coder-next — see the documentation for request format.
  3. 3
    You'll be billed $0.50/1M input, $1.20/1M output tokens. See full pricing.

Code Examples

Install
pip install openai
API key
AI_GATEWAY_API_KEY
Model ID
alibaba/qwen3-coder-next

creator/model-name e.g. kwaipilot/kat-coder-pro-v2

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
    base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
    model="alibaba/qwen3-coder-next",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Vercel AI Gateway

Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK. API details: API key via Authorization: Bearer <AI_GATEWAY_API_KEY>. Key from Vercel dashboard. Free $5/month credit; paid tier is provider list price with zero markup. BYOK (bring-your-own-key) also supported with no markup or fee. Model IDs use {provider-owner}/{model-name} — e.g., anthropic/claude-opus-4.6, openai/gpt-5.

Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK.

Pricing on Vercel AI Gateway

TypePrice (per 1M)
Input tokens$0.50
Output tokens$1.20

Capabilities

ReasoningFunction CallingTool UseStructured OutputsCode Execution

About Qwen3-Coder-Next

Qwen3-Coder-Next is an ultra-sparse Mixture-of-Experts coding agent model from Alibaba's Qwen team, released February 3, 2026 under Apache 2.0. It has 80B total parameters with 3B active at inference, delivering substantially higher throughput than comparable dense models. It supports a native 256K context window, function calling, structured outputs, Claude Code, Qwen Code, Cline, Kilo, and other scaffold templates. Benchmarks reported in the DAT-3724 datapack include SWE-Bench Pro 44.3%, SWE-Bench Resolved 70.6%, and TerminalBench 2 36.2%.

Model Specs

Released2026-02-03
Parameters80B total, 3B active
Context256K
Architecturemoe

Provider

Vercel AI Gateway

Vercel