Last refreshed 2026-06-13. Next refresh: weekly.
Why use DeepSeek V4 Pro on Fireworks AI?
Fireworks AI offers DeepSeek V4 Pro with pay-as-you-go pricing at $1.74/1M input tokens. Fireworks AI offers a generative AI platform as a service, focusing on rapid product iteration and cost-efficient AI deployment.
Compare DeepSeek V4 Pro across 5 providers to find the best fit for your use caseSetup recipe
Python + curlpip install openaiexport FIREWORKS_API_KEY=...import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["FIREWORKS_API_KEY"],deepseek-v4-proRequest example
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["FIREWORKS_API_KEY"],
base_url="https://api.fireworks.ai/inference/v1"
)
response = client.chat.completions.create(
model="deepseek-v4-pro",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)Gotchas
- Fireworks model IDs use "accounts/fireworks/models/{model-name}" format, e.g. "accounts/fireworks/models/llama4-scout-instruct-basic" or "accounts/fireworks/models/deepseek-r1".
- The examples expect FIREWORKS_API_KEY; rename it only if your application config maps the new variable.
Compare DeepSeek V4 Pro Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| DeepSeek Platform | $0.43 | $0.87 |
| Fireworks AI | $1.74 | $3.48 |
| OpenRouter | $0.44 | $0.87 |
| Vercel AI Gateway | $0.43 | $0.87 |
| Novita AI | $1.64 | $3.38 |
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $1.74 |
| Output tokens | $3.48 |
Capabilities
About DeepSeek V4 Pro
DeepSeek V4 Pro is DeepSeek's flagship open-weights model, released April 24 2026 under the MIT license. Architecture: 1.6T total / 49B active parameters, MoE with Compressed Sparse Attention (CSA) + Heavily Compressed Attention (HCA) hybrid — requiring only 27% of inference FLOPs vs standard 1M-context transformers — plus Manifold-Constrained Hyper-Connections (mHC) and Muon Optimizer. Context window: 1,000,000 tokens; max output: 384,000 tokens (Think Max mode requires ≥384K context). Text-only (no vision/image input). Supports three reasoning modes: Non-Think, Think High, Think Max. Function calling, tool use, and structured outputs supported. Key benchmarks: SWE-bench Verified 80.6%, SWE-bench Pro 55.4%, LiveCodeBench 93.5%, GPQA Diamond 90.1%, MMLU-Pro 87.5%, Terminal-Bench 2.0 67.9%, Chatbot Arena 1460 (2026-04-28). Current API pricing: $0.435/$0.87 per 1M input/output tokens; DeepSeek made the former 75% promotional rate permanent effective 2026-05-31 15:59 UTC.
FAQ
What does DeepSeek V4 Pro cost on Fireworks AI?
On Fireworks AI, DeepSeek V4 Pro costs $1.74 per 1M input tokens and $3.48 per 1M output tokens.
What is the context window for DeepSeek V4 Pro on Fireworks AI?
DeepSeek V4 Pro supports a 1m token context window on Fireworks AI.
How does Fireworks AI compare to other DeepSeek V4 Pro providers?
DeepSeek V4 Pro is available from 5 providers. The cheapest input pricing is $0.435/1M tokens from DeepSeek Platform.
Who created DeepSeek V4 Pro?
DeepSeek V4 Pro was created by DeepSeek as part of the DeepSeek V4 model family.
Is DeepSeek V4 Pro open source?
DeepSeek V4 Pro is open source under MIT according to the seed data.