Zephyr 7B Beta
Zephyr 7B Beta is worth evaluating for coding and classification when its provider route and context window match the workload.
Use it for
- Teams evaluating coding and classification
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Zephyr
- Released
- 2023-10-26
- Parameters
- 7B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
Cheapest of 2 routes · Fireworks AI
About
Zephyr 7B Beta is a 7-billion parameter large language model, fine-tuned from the Mistral-7B-v0.1 model. It is tailored to serve as an effective virtual assistant, performing well in generating human-like responses. The model's training involved Direct Preference Optimization (DPO) on a combination of publicly available and synthetic datasets, achieving strong performance on benchmarks like MT-Bench and AlpacaEval, especially for conversational tasks. However, its complexity falls short when compared to proprietary models, especially in tasks involving coding and mathematics. A notable limitation is its insufficient alignment to human safety preferences and the absence of in-the-loop filtering to prevent problematic outputs. Zephyr 7B Beta is English-based and carries an MIT license.
Zephyr 7B Beta is a model in the Zephyr family. This page tracks provider routes through Fireworks AI and Replicate API, with the cheapest tracked route listed at $0.05 input and $0.25 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 47.3, HellaSwag 88.1, and HumanEval 67.8.
Top use-case fit: coding, agents, and build tasks
Coding
Q/$ A1 relevant benchmark in the decision map.
Classification
Q/$ B2 relevant benchmarks in the decision map.
Provider price ladder
Compare all 2Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.200 | $0.200 | Provisioned |
| Replicate API | $0.050 | $0.250 | Serverless |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
Benchmark scores(4)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Google-Proof Q&A | 47.3 | diamond | research |
| HellaSwag | 88.1 | 10-shot | research |
| HumanEval | 67.8 | pass@1 | research |
| Massive Multitask Language Understanding | 71.4 | 5-shot | research |
Migration checks
No linked migration route is available for this model yet.