LLM Reference

Arctic Models by Snowflake

SnowflakeApache 2.0Open source
2 models2024Up to 4k ctxFrom $0.65/1M input

Details

ResearcherSnowflake
LicenseApache 2.0(OSI)
Commercial useCommercial use allowed
Models2
Released2024
Max context4k

Capabilities

Structured Outputs1 of 2 models

About

Snowflake's Arctic family of LLMs is a powerful suite of large language models engineered for enterprise applications, emphasizing both intelligence and resource efficiency. Leading the collection is the Arctic model, which features a unique Mixture-of-Experts (MoE) hybrid transformer architecture. This combines a dense 10B parameter transformer with a 128x3.66B MoE MLP, resulting in an impressive total of 480B parameters, though only 17B are active at any given time. This innovative design allows Arctic to deliver high-quality outcomes while optimizing resource use, outperforming many other prominent open models in various benchmarks. The Arctic lineup also includes models like Arctic-Instruct, tailored for following instructions and generating high-quality responses from natural language queries, and Arctic Embed, a set of text embedding models engineered for retrieval tasks. All models are available under the Apache 2.0 license, fostering open usage and collaboration 12.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view1 retired

Use when the workload needs 800M parameters.

2024-05800M parameters

Release Timeline

2 release groups
2024-05
1 current
Arctic-TILT
800M parameters
Current
2024-04
1 retired
Arctic
4k context480B parametersstructured outputs
Archived

Specifications(2 models)

Arctic model specifications comparison
ModelReleasedContextParametersStructured Outputs
Arctic-TILT2024-05800MNo

Pricing

Frequently Asked Questions

What is Arctic used for?
Arctic is used for structured outputs and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Arctic compare to Claude 3?
Arctic by Snowflake is strongest where you need structured outputs, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Arctic has 2 listed variants and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Arctic model should I use?
For the lowest listed input price, start with Arctic through Replicate API at $0.65/1M input tokens. For the most capable/latest local choice, evaluate Arctic-TILT.

Models(2)