What is Arctic used for?

Arctic is used for structured outputs and coding. The family description and listed model capabilities point to those workloads as the best fit.

How does Arctic compare to Claude 3?

Arctic by Snowflake is strongest where you need structured outputs, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Arctic has 2 listed variants and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.

Which Arctic model should I use?

For the lowest listed input price, start with Arctic through Replicate API at $0.65/1M input tokens. For the most capable/latest local choice, evaluate Arctic-TILT.

Arctic Models by Snowflake

SnowflakeApache 2.0Open source

2 models2024Up to 4k ctxFrom $0.65/1M input

Details

ResearcherSnowflake

LicenseApache 2.0OSI-approved

Commercial useCommercial use: permitted

Models2

Released2024

Max context4k

Capabilities

Structured Outputs1 of 2 models

Links

Website HuggingFace

About

Snowflake's Arctic family of LLMs is a powerful suite of large language models engineered for enterprise applications, emphasizing both intelligence and resource efficiency. Leading the collection is the Arctic model, which features a unique Mixture-of-Experts (MoE) hybrid transformer architecture. This combines a dense 10B parameter transformer with a 128x3.66B MoE MLP, resulting in an impressive total of 480B parameters, though only 17B are active at any given time. This innovative design allows Arctic to deliver high-quality outcomes while optimizing resource use, outperforming many other prominent open models in various benchmarks. The Arctic lineup also includes models like Arctic-Instruct, tailored for following instructions and generating high-quality responses from natural language queries, and Arctic Embed, a set of text embedding models engineered for retrieval tasks. All models are available under the Apache 2.0 license, fostering open usage and collaboration 12.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view1 retired

Arctic-TILTCurrent

Use when the workload needs 800M parameters.

2024-05800M parameters

Current Arctic variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Arctic-TILT	Use when the workload needs 800M parameters.	2024-05	800M parameters	Current

Release Timeline

2 release groups

2024-05

1 current

Arctic-TILT

800M parameters

Current

2024-04

1 retired

Arctic

4k context480B parametersstructured outputs

Archived

Specifications(2 models)

Arctic model specifications comparison
Model	Released	Context	Parameters	Structured Outputs
Arctic-TILT	2024-05	—	800M	No

Available From(4 providers)

Pricing

Frequently Asked Questions

What is Arctic used for?: Arctic is used for structured outputs and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Arctic compare to Claude 3?: Arctic by Snowflake is strongest where you need structured outputs, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Arctic has 2 listed variants and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Arctic model should I use?: For the lowest listed input price, start with Arctic through Replicate API at $0.65/1M input tokens. For the most capable/latest local choice, evaluate Arctic-TILT.

Models(2)

Arctic-TILT

2024-05800M

Open Source