LLM Reference

OpenHermes 2.5 Mistral 7B

Released
2023-12-15
Last refreshed
2026-05-11
Status
Researched 46d ago
CodingClassification

OpenHermes 2.5 Mistral 7B is worth evaluating for coding and classification when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding and classification
  • Workloads that can use a 32k context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Released
2023-12-15
Context
32k
Parameters
7B
Architecture
Decoder Only
Knowledge cutoff
2023-12
Specialization
general
Training
finetuned
Created by

Steerable AI models for open-source innovation

N/A
Founded N/A
Website
Pricing
Output / 1M
$0.200
Input / 1M
$0.200

Cheapest of 2 routes · Fireworks AI

About

OpenHermes 2.5 Mistral 7B is an advanced large language model developed by Teknium, building on the previous version, OpenHermes 2. Utilizing a transformer architecture, it's fine-tuned on over one million entries, combining code and non-code data, primarily composed of GPT-4 generated text. This enhances its human-like response capabilities across diverse contexts. It excels in conversational AI with its multi-turn dialogue support through the ChatML format, significantly improves in code generation tasks with a high HumanEval score, and performs robustly on benchmarks like GPT4All and AGIEval. Additionally, it offers various quantization options for optimized hardware performance and is adept at real-time data processing, making it ideal for customer service and immediate data analysis applications.

OpenHermes 2.5 Mistral 7B is a model in the OpenHermes 2 family. The structured metadata tracks a 32k-token context window. This page tracks provider routes through Together AI and Fireworks AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 38.6, HellaSwag 86.0, and HumanEval 54.8.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ B

1 relevant benchmark in the decision map.

Classification

Q/$ B

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.200$0.200
Provisioned
Together AI$0.200$0.200
Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Google-Proof Q&A38.6diamondhttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
HellaSwag86.010-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
HumanEval54.8pass@1https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
Massive Multitask Language Understanding63.55-shothttps://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(7)