LLM Reference

Zephyr ORPO 141B

Released
2023-10-26
Last refreshed
2026-04-24
Status
Researched 46d ago
ClassificationJSON / Tool use

Zephyr ORPO 141B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

  • Teams evaluating classification and json / tool use
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
Zephyr
Released
2023-10-26
Parameters
141B
Architecture
Decoder Only
Knowledge cutoff
2024-01
Specialization
general
Training
finetuned
Created by

Community-driven open-source AI model hub

New York City, New York, United States
Founded 2016
Website
Pricing
Output / 1M
$0.650
Input / 1M
$0.650

Cheapest of 1 route · DeepInfra

About

The Zephyr ORPO 141B is a cutting-edge large language model by Hugging Face, developed in partnership with Argilla and KAIST. It employs a Mixture of Experts (MoE) architecture, consisting of 141 billion parameters, with 39 billion active during operation. The model is derived from the Mixtral-8x22B framework and fine-tuned using the innovative Odds Ratio Preference Optimization (ORPO) method, which improves computational efficiency by removing the need for a separate supervised fine-tuning phase. Zephyr ORPO demonstrates impressive proficiency in tasks such as open-ended conversations, question answering, and coding assistance, evidenced by high scores on benchmarks like MT Bench (8.17) and IFEval (65.06) 139. The model, trained on a dataset of 7,000 instances, is optimized for multi-turn dialogues, making it ideal for interactive AI applications. However, attention should be paid to its lack of alignment with human safety preferences, as this could lead to problematic outputs 4512.

Zephyr ORPO 141B is a model in the Zephyr family. The structured metadata tracks structured outputs. This page tracks provider routes through DeepInfra, with the cheapest tracked route listed at $0.65 input and $0.65 output per 1M tokens. No headline benchmark score is tracked for Zephyr ORPO 141B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
DeepInfra$0.650$0.650
Serverless

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)