How much does Zephyr ORPO 141B cost?

Zephyr ORPO 141B is available at $0.65/1M input tokens through DeepInfra.

When was Zephyr ORPO 141B released?

Zephyr ORPO 141B was released on 2023-10-26.

Which providers offer Zephyr ORPO 141B?

Zephyr ORPO 141B is available from 1 provider: DeepInfra.

Zephyr ORPO 141B

Name: Zephyr ORPO 141B
Author: Hugging Face H4

Released

2023-10-26

Last refreshed

2026-04-24

Status

Researched 46d ago

ClassificationJSON / Tool use

Zephyr ORPO 141B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

Teams evaluating classification and json / tool use
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Zephyr
Released: 2023-10-26
Parameters: 141B
Architecture: Decoder Only
Knowledge cutoff: 2024-01
Specialization: general
Training: finetuned

Created by

Hugging Face H4

Community-driven open-source AI model hub

New York City, New York, United States

Founded 2016

Website

Pricing

Output / 1M

$0.650

Input / 1M

$0.650

Cheapest of 1 route · DeepInfra

Providers(1)

DeepInfra

View 1 provider route

About

The Zephyr ORPO 141B is a cutting-edge large language model by Hugging Face, developed in partnership with Argilla and KAIST. It employs a Mixture of Experts (MoE) architecture, consisting of 141 billion parameters, with 39 billion active during operation. The model is derived from the Mixtral-8x22B framework and fine-tuned using the innovative Odds Ratio Preference Optimization (ORPO) method, which improves computational efficiency by removing the need for a separate supervised fine-tuning phase. Zephyr ORPO demonstrates impressive proficiency in tasks such as open-ended conversations, question answering, and coding assistance, evidenced by high scores on benchmarks like MT Bench (8.17) and IFEval (65.06) 139. The model, trained on a dataset of 7,000 instances, is optimized for multi-turn dialogues, making it ideal for interactive AI applications. However, attention should be paid to its lack of alignment with human safety preferences, as this could lead to problematic outputs 4512.

Zephyr ORPO 141B is a model in the Zephyr family. The structured metadata tracks structured outputs. This page tracks provider routes through DeepInfra, with the cheapest tracked route listed at $0.65 input and $0.65 output per 1M tokens. No headline benchmark score is tracked for Zephyr ORPO 141B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
DeepInfra	$0.650	$0.650	Serverless

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)

Best LLMs for ClassificationListed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed