Using DeepSeek V4 Pro on Vercel AI Gateway
Implementation guide · DeepSeek V4 · DeepSeek
Quick Start
- 1
- 2Use the Vercel AI Gateway SDK or REST API to call
deepseek/deepseek-v4-pro— see the documentation for request format. - 3
Code Examples
pip install openaiAI_GATEWAY_API_KEYdeepseek/deepseek-v4-procreator/model-name e.g. kwaipilot/kat-coder-pro-v2
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["AI_GATEWAY_API_KEY"],
base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
model="deepseek/deepseek-v4-pro",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)About Vercel AI Gateway
Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK. API details: API key via Authorization: Bearer <AI_GATEWAY_API_KEY>. Key from Vercel dashboard. Free $5/month credit; paid tier is provider list price with zero markup. BYOK (bring-your-own-key) also supported with no markup or fee. Model IDs use {provider-owner}/{model-name} — e.g., anthropic/claude-opus-4.6, openai/gpt-5.
Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK.
Pricing on Vercel AI Gateway
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.43 |
| Output tokens | $0.87 |
Capabilities
About DeepSeek V4 Pro
DeepSeek V4 Pro is DeepSeek's flagship open-weights model, released April 24 2026 under the MIT license. Architecture: 1.6T total / 49B active parameters, MoE with Compressed Sparse Attention (CSA) + Heavily Compressed Attention (HCA) hybrid — requiring only 27% of inference FLOPs vs standard 1M-context transformers — plus Manifold-Constrained Hyper-Connections (mHC) and Muon Optimizer. Context window: 1,000,000 tokens; max output: 384,000 tokens (Think Max mode requires ≥384K context). Text-only (no vision/image input). Supports three reasoning modes: Non-Think, Think High, Think Max. Function calling, tool use, and structured outputs supported. Key benchmarks: SWE-bench Verified 80.6%, SWE-bench Pro 55.4%, LiveCodeBench 93.5%, GPQA Diamond 90.1%, MMLU-Pro 87.5%, Terminal-Bench 2.0 67.9%, Chatbot Arena 1460 (2026-04-28). Current API pricing: $0.435/$0.87 per 1M input/output tokens (75% discount active until 2026-05-31 15:59 UTC); regular rate $1.74/$3.48.