LLM ReferenceLLM Reference
NVIDIA NIM

DeepSeek V3.1 Terminus on NVIDIA NIM

DeepSeek · DeepSeek

ServerlessOpen Source

Last refreshed 2026-05-01. Next refresh: weekly.

Why use DeepSeek V3.1 Terminus on NVIDIA NIM?

NVIDIA NIM offers DeepSeek V3.1 Terminus with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Compare DeepSeek V3.1 Terminus across 2 providers to find the best fit for your use case
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: deepseek-ai/deepseek-v3_1-terminus
Model ID
deepseek-ai/deepseek-v3_1-terminus

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID deepseek-ai/deepseek-v3_1-terminus.

Gotchas

  • Use provider model ID "deepseek-ai/deepseek-v3_1-terminus", not the LLMReference slug "deepseek-v3.1-terminus".

Compare DeepSeek V3.1 Terminus Across Providers

ProviderInput (per 1M)Output (per 1M)
NVIDIA NIM
OpenRouter$0.21$0.79

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr

Capabilities

Structured Outputs

About DeepSeek V3.1 Terminus

DeepSeek: DeepSeek V3.1 Terminus available via OpenRouter. Pricing: $0.21/1M input, $0.79/1M output.

FAQ

What is the context window for DeepSeek V3.1 Terminus on NVIDIA NIM?

DeepSeek V3.1 Terminus supports a 164,000 token context window on NVIDIA NIM.

How does NVIDIA NIM compare to other DeepSeek V3.1 Terminus providers?

DeepSeek V3.1 Terminus is available from 2 providers. The cheapest input pricing is $0.21/1M tokens from OpenRouter.

What API model ID do I use for DeepSeek V3.1 Terminus on NVIDIA NIM?

Use the model ID deepseek-ai/deepseek-v3_1-terminus when calling NVIDIA NIM's API.

Who created DeepSeek V3.1 Terminus?

DeepSeek V3.1 Terminus was created by DeepSeek as part of the DeepSeek model family.

Is DeepSeek V3.1 Terminus open source?

DeepSeek V3.1 Terminus is open source according to the seed data.

Get Started

Model Specs

Released2025-04-01
Context164K
ArchitectureDecoder Only