llmreference
Together AI

Using Arctic on Together AI

Implementation guide · Arctic · Snowflake

Serverless

Quick Start

  1. 1
    Create an account at Together AI and generate an API key.
  2. 2
    Use the Together AI SDK or REST API to call arctic — see the documentation for request format.
  3. 3
    You'll be billed $2.40/1M input, $2.40/1M output tokens. See full pricing.

Code Examples

Install
pip install together
API key
TOGETHER_API_KEY
Model ID
arctic

Together uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.

from together import Together

client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="arctic",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Together AI

Platform for running open-source and proprietary LLMs

Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Pricing on Together AI

TypePrice (per 1M)
Input tokens$2.40
Output tokens$2.40

Capabilities

Structured Outputs

About Arctic

Snowflake Arctic is an advanced large language model tailored for enterprise applications by Snowflake AI Research. It features an innovative Dense-MoE Hybrid transformer architecture, combining a 10 billion parameter dense transformer with a 128 x 3.66 billion parameter MoE MLP, totaling 480 billion parameters but utilizing only 17 billion actively. This structure optimizes efficiency, particularly for tasks like SQL generation, coding, and instruction following. The model's training spanned a diverse dataset of 3.5 trillion tokens, focusing on enterprise needs. Despite its capabilities, Arctic's deployment presents challenges due to its size, and it remains vulnerable to inaccuracies with unclear inputs. Open-sourced under the Apache 2.0 license, it provides comprehensive access to its weights, code, and research findings 1 2 4 5.

Model Specs

Released2024-04-24
Parameters480B
Context4K
ArchitectureMixture of Experts

Provider

Together AI
Together AI

San Francisco, California, United States