LLM Reference

Starling Alpha Models by Berkeley Artificial Intelligence Research (BAIR)

1 model2024Up to 8k ctx

About

The Starling Alpha family of large language models (LLMs), developed by the Berkeley NEST team, includes models like Starling-LM-7B-alpha and Starling-LM-7B-beta. These models are fine-tuned versions of OpenChat 3.5, employing Reinforcement Learning from AI Feedback (RLAIF) 15. Leveraging the Nectar dataset and advanced reward training and policy tuning pipelines, the models excel in conversational AI, content generation, and question answering, achieving high scores on the MT Bench benchmark, with the beta version scoring 8.12 2. Available on Hugging Face and other platforms, these open-source models have restricted licenses for commercial use and competition with OpenAI 5.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view

Use when the workload needs 8k context and 7B parameters.

2024-028k context7B parameters

Release Timeline

1 release group
2024-02
1 current
Starling LM 7B Alpha
8k context7B parameters
Current

Specifications(1 models)

Starling Alpha model specifications comparison
ModelReleasedContextParameters
Starling LM 7B Alpha2024-028k7B

Frequently Asked Questions

What is Starling Alpha used for?
Starling Alpha is used for math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does Starling Alpha compare to MOSS-Audio?
Starling Alpha by Berkeley Artificial Intelligence Research (BAIR) is strongest where you need math-heavy prompts, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Starling Alpha has 1 listed variant and reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.
Which Starling Alpha model should I use?
If price is the main constraint, use the pricing table first because Starling Alpha does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Starling LM 7B Alpha with 8k context.

Models(1)