LLM Reference
Fireworks AI

DeepSeek V4 Pro on Fireworks AI

DeepSeek V4 · DeepSeek

ServerlessOpen Source

Last refreshed 2026-06-13. Next refresh: weekly.

Why use DeepSeek V4 Pro on Fireworks AI?

Fireworks AI offers DeepSeek V4 Pro with pay-as-you-go pricing at $1.74/1M input tokens. Fireworks AI offers a generative AI platform as a service, focusing on rapid product iteration and cost-efficient AI deployment.

Compare DeepSeek V4 Pro across 5 providers to find the best fit for your use case
Input / 1M
$1.74
Output / 1M
$3.48
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export FIREWORKS_API_KEY=...
Call
import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["FIREWORKS_API_KEY"],
Model ID
deepseek-v4-pro

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["FIREWORKS_API_KEY"],
    base_url="https://api.fireworks.ai/inference/v1"
)
response = client.chat.completions.create(
    model="deepseek-v4-pro",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • Fireworks model IDs use "accounts/fireworks/models/{model-name}" format, e.g. "accounts/fireworks/models/llama4-scout-instruct-basic" or "accounts/fireworks/models/deepseek-r1".
  • The examples expect FIREWORKS_API_KEY; rename it only if your application config maps the new variable.

Compare DeepSeek V4 Pro Across Providers

ProviderInput (per 1M)Output (per 1M)
DeepSeek Platform$0.43$0.87
Fireworks AI$1.74$3.48
OpenRouter$0.44$0.87
Vercel AI Gateway$0.43$0.87
Novita AI$1.64$3.38

Pricing

TypePrice (per 1M)
Input tokens$1.74
Output tokens$3.48

Capabilities

ReasoningFunction CallingTool UseStructured OutputsPrompt Caching

About DeepSeek V4 Pro

DeepSeek V4 Pro is DeepSeek's flagship open-weights model, released April 24 2026 under the MIT license. Architecture: 1.6T total / 49B active parameters, MoE with Compressed Sparse Attention (CSA) + Heavily Compressed Attention (HCA) hybrid — requiring only 27% of inference FLOPs vs standard 1M-context transformers — plus Manifold-Constrained Hyper-Connections (mHC) and Muon Optimizer. Context window: 1,000,000 tokens; max output: 384,000 tokens (Think Max mode requires ≥384K context). Text-only (no vision/image input). Supports three reasoning modes: Non-Think, Think High, Think Max. Function calling, tool use, and structured outputs supported. Key benchmarks: SWE-bench Verified 80.6%, SWE-bench Pro 55.4%, LiveCodeBench 93.5%, GPQA Diamond 90.1%, MMLU-Pro 87.5%, Terminal-Bench 2.0 67.9%, Chatbot Arena 1460 (2026-04-28). Current API pricing: $0.435/$0.87 per 1M input/output tokens; DeepSeek made the former 75% promotional rate permanent effective 2026-05-31 15:59 UTC.

FAQ

What does DeepSeek V4 Pro cost on Fireworks AI?

On Fireworks AI, DeepSeek V4 Pro costs $1.74 per 1M input tokens and $3.48 per 1M output tokens.

What is the context window for DeepSeek V4 Pro on Fireworks AI?

DeepSeek V4 Pro supports a 1m token context window on Fireworks AI.

How does Fireworks AI compare to other DeepSeek V4 Pro providers?

DeepSeek V4 Pro is available from 5 providers. The cheapest input pricing is $0.435/1M tokens from DeepSeek Platform.

Who created DeepSeek V4 Pro?

DeepSeek V4 Pro was created by DeepSeek as part of the DeepSeek V4 model family.

Is DeepSeek V4 Pro open source?

DeepSeek V4 Pro is open source under MIT according to the seed data.

Get Started