llmreference
Together AI

Using DeepSeek 67B Chat on Together AI

Implementation guide · DeepSeek · DeepSeek

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Together AI and generate an API key.
  2. 2
    Use the Together AI SDK or REST API to call deepseek-67b-chat — see the documentation for request format.
  3. 3
    You'll be billed $0.90/1M input, $0.90/1M output tokens. See full pricing.

Code Examples

Install
pip install together
API key
TOGETHER_API_KEY
Model ID
deepseek-67b-chat

Together uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.

from together import Together

client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="deepseek-67b-chat",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Together AI

Platform for running open-source and proprietary LLMs

Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Pricing on Together AI

TypePrice (per 1M)
Input tokens$0.90
Output tokens$0.90

Capabilities

Structured Outputs

About DeepSeek 67B Chat

DeepSeek LLM 67B Chat is a sophisticated language model with 67 billion parameters, leveraging the LLaMA architecture with enhancements such as Grouped-Query Attention across 95 layers. Trained on a vast corpus of 2 trillion tokens in English and Chinese, it excels in tasks like text generation, question answering, and fluent conversation, demonstrating superior performance in reasoning, coding, and mathematics compared to some larger models. Despite its advanced capabilities, the model can exhibit biases from its training data, experience hallucinations, and produce repetitive outputs. Due to its size, substantial computational resources are needed for inference, although quantization methods can reduce its size with potential trade-offs in quality.

Model Specs

Released2023-11-29
Parameters67B
Context4K
ArchitectureDecoder Only
Knowledge cutoff2023-05

Provider

Together AI
Together AI

San Francisco, California, United States