LLM Reference

Using DeepSeek V4 Flash on Vercel AI Gateway

Implementation guide · DeepSeek V4 · DeepSeek

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Vercel AI Gateway and generate an API key.
  2. 2
    Use the Vercel AI Gateway SDK or REST API to call deepseek/deepseek-v4-flash — see the documentation for request format.
  3. 3
    You'll be billed $0.14/1M input, $0.28/1M output tokens. See full pricing.

Code Examples

Install
pip install openai
API key
AI_GATEWAY_API_KEY
Model ID
deepseek/deepseek-v4-flash

creator/model-name e.g. kwaipilot/kat-coder-pro-v2

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
    base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
    model="deepseek/deepseek-v4-flash",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Vercel AI Gateway

Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK. API details: API key via Authorization: Bearer <AI_GATEWAY_API_KEY>. Key from Vercel dashboard. Free $5/month credit; paid tier is provider list price with zero markup. BYOK (bring-your-own-key) also supported with no markup or fee. Model IDs use {provider-owner}/{model-name} — e.g., anthropic/claude-opus-4.6, openai/gpt-5.

Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more. Pricing is pass-through at provider list rates with zero markup. Includes $5/month free tier; paid is pay-as-you-go. Features: automatic provider fallbacks, unified observability, streaming, tool use, vision, embeddings, image/video generation, BYOK mode. Integrates via @ai-sdk/gateway package or plain model ID strings in Vercel AI SDK.

Pricing on Vercel AI Gateway

TypePrice (per 1M)
Input tokens$0.14
Output tokens$0.28

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

About DeepSeek V4 Flash

DeepSeek V4 Flash is a 284B parameter (13B activated) Mixture-of-Experts language model with 1M-token context. Features a hybrid attention architecture combining Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) for efficient long-context inference. Supports thinking and non-thinking modes. Legacy API aliases deepseek-chat and deepseek-reasoner map to this model's non-thinking and thinking modes respectively. Pricing: $0.14/1M input, $0.28/1M output (cache hit: $0.0028/1M input). MIT licensed.

Model Specs

Released2026-04-24
Parameters284B
Context1M
ArchitectureMixture of Experts

Provider

Vercel AI Gateway

Vercel