Hermes 2 Pro Llama 3 8B
Hermes 2 Pro Llama 3 8B is worth evaluating for general LLM work when its provider route and context window match the workload.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 8k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Hermes 2
- Released
- 2023-12-12
- Context
- 8k
- Parameters
- 8B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2023-12
- Specialization
- general
- Training
- finetuned
Cheapest of 4 routes · Novita AI
About
8B Hermes model merging Hermes 2 Pro with Llama 3 architecture for superior function calling and structured outputs. Excels in ChatML format multi-turn conversations.
Hermes 2 Pro Llama 3 8B is a model in the Hermes 2 family. The structured metadata tracks a 8k-token context window. This page tracks provider routes through OctoAI API (Deprecated), Microsoft Foundry, OpenRouter, and 1 more, with the cheapest tracked route listed at $0.14 input and $0.14 output per 1M tokens. No headline benchmark score is tracked for Hermes 2 Pro Llama 3 8B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
Compare all 4Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Novita AI | $0.140 | $0.140 | Serverless |
| OpenRouter | $0.140 | $0.140 | Serverless |
| OctoAI API (Deprecated) | $0.150 | $0.150 | Serverless |
| Microsoft Foundry | $0.370 | $1.10 | Provisioned |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.