Hermes 2 Pro Llama 3 8B

Name: Hermes 2 Pro Llama 3 8B
Author: Nous Research

Released

2023-12-12

Last refreshed

2026-07-11

Status

Researched 60d ago

Open sourceCommercial use: permitted

Hermes 2 Pro Llama 3 8B is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

Teams evaluating general LLM work
Workloads that can use a 8k context window
Buyers comparing 3 tracked provider routes

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Hermes 2
Released: 2023-12-12
Context: 8k
Parameters: 8B
Architecture: Decoder Only
Knowledge cutoff: 2023-12
Specialization: general
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

Nous Research

Human-centric AI model innovation

New York, New York, United States

Founded 2023

Website

Pricing

Output / 1M

$0.140

Input / 1M

$0.140

Cheapest of 4 routes · Novita AI

Providers(4)

OctoAI API (Deprecated)Microsoft Foundry OpenRouter Novita AI

View 4 provider routes

About

8B Hermes model merging Hermes 2 Pro with Llama 3 architecture for superior function calling and structured outputs. Excels in ChatML format multi-turn conversations.

Hermes 2 Pro Llama 3 8B is an open-source model in the Hermes 2 family. The structured metadata tracks a 8k-token context window. This page tracks provider routes through OctoAI API (Deprecated), Microsoft Foundry, OpenRouter, and 1 more, with the cheapest tracked route listed at $0.14 input and $0.14 output per 1M tokens. No headline benchmark score is tracked for Hermes 2 Pro Llama 3 8B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare all 4

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Novita AI	$0.140	$0.140	Serverless
OpenRouter	$0.140	$0.140	Serverless
Microsoft Foundry	$0.370	$1.10	Provisioned

Available via routers & gateways(5)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMicrosoft Foundry

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionMicrosoft Foundry

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft Foundry

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionMicrosoft Foundry