ALLaM Models by Saudi Data and Artificial Intelligence Authority
About
The ALLaM family, developed by the Saudi Data and Artificial Intelligence Authority (SDAIA), comprises large language models (LLMs) tailored for Arabic Language Technologies (ALT). Designed to be proficient in both Arabic and English, these models employ an autoregressive decoder-only architecture and are pretrained on a blend of Arabic and English texts. A critical focus of their development is on language alignment and knowledge transfer, striving for state-of-the-art performance in Arabic benchmarks. SDAIA has introduced several models within this family, including 7B, 13B, and 70B parameter models, some of which are built from scratch, while others extend training from models like Llama-2. These models are accessible via IBM's Watsonx platform under a royalty-free license, supporting both commercial and governmental applications. Significant data collection and curation efforts have resulted in one of the largest global Arabic datasets.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 4k context and 13B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| ALLaM 13B | Use when the workload needs 4k context and 13B parameters. | 2023-09 | 4k context13B parameters | Current |
Release Timeline
1 release groupSpecifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| ALLaM 13B | 2023-09 | 4k | 13B |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| ALLaM 13B | IBM watsonx | $1.8 | $1.8 | Serverless |
Frequently Asked Questions
- What is ALLaM used for?
- ALLaM is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
- How does ALLaM compare to Claude 3?
- ALLaM by Saudi Data and Artificial Intelligence Authority is strongest where you need coding, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. ALLaM has 1 listed variant and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
