LLM Reference

Zephyr Models by Hugging Face H4

Hugging Face H4Open Source
5 models2023Up to 8k ctxFrom $0.05/1M input

About

The Zephyr family comprises advanced large language models specifically designed to function as highly responsive digital assistants. Notable for their human-like conversational abilities, these models excel in applications involving chatbots and virtual assistant roles. Developed with cutting-edge techniques such as distilled supervised fine-tuning (dSFT), AI feedback (AIF), and distilled direct preference optimization (dDPO), Zephyr models ensure that their output aligns closely with user intent. They often outperform larger models on certain benchmarks, despite being more compact 24. However, in areas requiring complex logic or specialized knowledge, they may face limitations 4. The Zephyr lineup includes iterations like Zephyr-7B-alpha and Zephyr-7B-beta, which is a fine-tuned variant of Mistral-7B 4513. Available through Hugging Face, these models are versatile tools for a range of natural language processing tasks 24.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view

Use when the workload needs 7B parameters.

2023-107B parameters

Use when the workload needs 7B parameters.

2023-107B parameters

Use when the workload needs 7B parameters.

2023-107B parameters

Use when the workload needs 141B parameters and structured outputs.

2023-10141B parametersstructured outputs

Use when the workload needs 8k context and 7B parameters.

2023-108k context7B parameters

Release Timeline

1 release group
2023-10
5 current
Fireworks Zephyr-7B-beta
8k context7B parameters
Current
Zephyr 7B Alpha
7B parameters
Current
Zephyr 7B Beta
7B parameters
Current
Zephyr 7B Gemma
7B parameters
Current
Zephyr ORPO 141B
141B parametersstructured outputs
Current

Specifications(5 models)

Zephyr model specifications comparison
ModelReleasedContextParametersStructured Outputs
Zephyr 7B Alpha2023-107BNo
Zephyr 7B Beta2023-107BNo
Zephyr 7B Gemma2023-107BNo
Zephyr ORPO 141B2023-10141BYes
Fireworks Zephyr-7B-beta2023-108k7BNo

Available From(4 providers)

Pricing

Zephyr model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Zephyr 7B BetaReplicate API$0.05$0.25Serverless
Zephyr 7B AlphaReplicate API$0.05$0.25Serverless
Fireworks Zephyr-7B-betaFireworks AI$0.1$0.1Serverless
Zephyr 7B BetaFireworks AI$0.2$0.2Provisioned
Zephyr ORPO 141BDeepInfra$0.65$0.65Serverless

Frequently Asked Questions

What is Zephyr used for?
Zephyr is used for structured outputs, coding, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does Zephyr compare to MOSS-Audio?
Zephyr by Hugging Face H4 is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Zephyr has 5 listed variants and reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.
Which Zephyr model should I use?
For the lowest listed input price, start with Zephyr 7B Beta through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Zephyr ORPO 141B with structured outputs.

Models(5)