
Starling Alpha
About
The Starling Alpha family of large language models (LLMs), developed by the Berkeley NEST team, includes models like Starling-LM-7B-alpha and Starling-LM-7B-beta. These models are fine-tuned versions of OpenChat 3.5, employing Reinforcement Learning from AI Feedback (RLAIF) 15. Leveraging the Nectar dataset and advanced reward training and policy tuning pipelines, the models excel in conversational AI, content generation, and question answering, achieving high scores on the MT Bench benchmark, with the beta version scoring 8.12 2. Available on Hugging Face and other platforms, these open-source models have restricted licenses for commercial use and competition with OpenAI 5.