Starling LM 7B Beta
Starling LM 7B Beta has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating coding and classification
- Workloads that can use a 8k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Starling
- Released
- 2024-02-05
- Context
- 8k
- Parameters
- 7B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
About
Starling LM 7B Beta is an open-source large language model crafted by Nexusflow, leveraging a 7-billion parameter transformer architecture tailored for conversational AI. Fine-tuned with Reinforcement Learning from AI Feedback (RLAIF), it aims to enhance helpfulness and minimize harm. Built on the foundation of the Openchat-3.5-0106 and Mistral-7B-v0.1 models, it utilizes the berkeley-nest/Nectar ranking dataset, Nexusflow/Starling-RM-34B reward model, and Proximal Policy Optimization (PPO) strategy. Achieving an improved MT Bench score of 8.12, its capabilities span engaging conversations, informative responses, and tasks like content and code generation. While it shows strong performance among 7B models, verbose outputs and strict adherence to a provided chat template are notable considerations. Licensed under Apache-2.0 with restrictions against competing with OpenAI, it continues to offer robust functionality within its calibrated framework.
Starling LM 7B Beta is a model in the Starling family. The structured metadata tracks a 8k-token context window. Headline tracked benchmarks include Google-Proof Q&A 49.7, HellaSwag 89.2, and HumanEval 74.2.
Top use-case fit: coding, agents, and build tasks
Coding
1 relevant benchmark in the decision map.
Classification
2 relevant benchmarks in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
Benchmark scores(4)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 49.7 | diamond | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| HellaSwag | 89.2 | 10-shot | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| HumanEval | 74.2 | pass@1 | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| Massive Multitask Language Understanding | 77.8 | 5-shot | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
Migration checks
No linked migration route is available for this model yet.