Zephyr ORPO 141B
Zephyr ORPO 141B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.
Use it for
- Teams evaluating classification and json / tool use
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Family
- Zephyr
- Released
- 2023-10-26
- Parameters
- 141B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2024-01
- Specialization
- general
- Training
- finetuned
Cheapest of 1 route · DeepInfra
About
The Zephyr ORPO 141B is a cutting-edge large language model by Hugging Face, developed in partnership with Argilla and KAIST. It employs a Mixture of Experts (MoE) architecture, consisting of 141 billion parameters, with 39 billion active during operation. The model is derived from the Mixtral-8x22B framework and fine-tuned using the innovative Odds Ratio Preference Optimization (ORPO) method, which improves computational efficiency by removing the need for a separate supervised fine-tuning phase. Zephyr ORPO demonstrates impressive proficiency in tasks such as open-ended conversations, question answering, and coding assistance, evidenced by high scores on benchmarks like MT Bench (8.17) and IFEval (65.06) 139. The model, trained on a dataset of 7,000 instances, is optimized for multi-turn dialogues, making it ideal for interactive AI applications. However, attention should be paid to its lack of alignment with human safety preferences, as this could lead to problematic outputs 4512.
Zephyr ORPO 141B is a model in the Zephyr family. The structured metadata tracks structured outputs. This page tracks provider routes through DeepInfra, with the cheapest tracked route listed at $0.65 input and $0.65 output per 1M tokens. No headline benchmark score is tracked for Zephyr ORPO 141B yet.
Top use-case fit
Classification
Included by capability and metadata signals in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| DeepInfra | $0.650 | $0.650 | Serverless |
Capabilities
Benchmark peer barsfor Classification
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.