LLM Reference

ALLaM Models by Saudi Data and Artificial Intelligence Authority

1 model2023Up to 4k ctxFrom $1.8/1M input

About

The ALLaM family, developed by the Saudi Data and Artificial Intelligence Authority (SDAIA), comprises large language models (LLMs) tailored for Arabic Language Technologies (ALT). Designed to be proficient in both Arabic and English, these models employ an autoregressive decoder-only architecture and are pretrained on a blend of Arabic and English texts. A critical focus of their development is on language alignment and knowledge transfer, striving for state-of-the-art performance in Arabic benchmarks. SDAIA has introduced several models within this family, including 7B, 13B, and 70B parameter models, some of which are built from scratch, while others extend training from models like Llama-2. These models are accessible via IBM's Watsonx platform under a royalty-free license, supporting both commercial and governmental applications. Significant data collection and curation efforts have resulted in one of the largest global Arabic datasets.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view
ALLaM 13BCurrent

Use when the workload needs 4k context and 13B parameters.

2023-094k context13B parameters

Release Timeline

1 release group
2023-09
1 current
ALLaM 13B
4k context13B parameters
Current

Specifications(1 models)

ALLaM model specifications comparison
ModelReleasedContextParameters
ALLaM 13B2023-094k13B

Available From(1 provider)

Pricing

ALLaM model pricing by provider
ModelProviderInput / 1MOutput / 1MType
ALLaM 13BIBM watsonx$1.8$1.8Serverless

Frequently Asked Questions

What is ALLaM used for?
ALLaM is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does ALLaM compare to Claude 3?
ALLaM by Saudi Data and Artificial Intelligence Authority is strongest where you need coding, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. ALLaM has 1 listed variant and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which ALLaM model should I use?
For the lowest listed input price, start with ALLaM 13B through IBM watsonx at $1.8/1M input tokens. For the most capable/latest local choice, evaluate ALLaM 13B with 4k context.

Models(1)