What is Hermes 3 used for?

Hermes 3 is used for agent workflows, structured outputs, and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.

How does Hermes 3 compare to MOSS-Audio?

Hermes 3 by Nous Research is strongest where you need agent workflows, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Hermes 3 has 3 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.

Which Hermes 3 model should I use?

For the lowest listed input price, start with Hermes 3 Llama 3.1 70B through OpenRouter at $0.7/1M input tokens. For the most capable/latest local choice, evaluate Hermes 3 Llama 3.1 405B with 128k context.

Hermes 3 Models by Nous Research

Nous ResearchLlama 3 CommunityOpen weightsOpen Source

3 models2024Up to 128k ctxFrom $0.7/1M input

Details

ResearcherNous Research

LicenseLlama 3 Community

Commercial useCommercial use: conditional

Models3

Released2024

Max context128k

Links

Website HuggingFace

About

The Hermes 3 family of large language models (LLMs), developed by NousResearch, represents a significant advancement in generalist instruction models 146. Built upon the Llama 3.1 foundation model, Hermes 3 models are available in 8B, 70B, and 405B parameter versions 146. A key design principle is enhanced steerability, achieved through targeted training to precisely follow system and instruction prompts in a neutral and adaptive manner 146. This leads to models that are highly responsive to system prompts, allowing fine-grained control over behavior and persona 8. In addition to instruction following, Hermes 3 features long-term context retention, multi-turn conversation, complex role-playing, internal monologue abilities, and enhanced agentic function-calling 146. These models excel in structured output generation, utilizing XML tags and scratchpads for transparency and accuracy 1. The training data is a carefully curated blend of approximately 390 million tokens, including a significant portion of synthetically generated responses to encourage precise instruction following and nuanced reasoning 13. While NousResearch claims superior performance over Llama 3.1 in certain areas 46, independent evaluations indicate mixed results, underscoring the challenges in benchmarking and comparing LLMs 3.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

3 in view

Hermes 3 Llama 3.1 405BCurrent

Use when the workload needs 128k context and 405B parameters.

2024-11128k context405B parameters

Hermes 3 Llama 3.1 70BCurrent

Use when the workload needs 128k context and 70B parameters.

2024-11128k context70B parameters

Hermes 3 Llama 3.1 8BCurrent

Use when the workload needs 128k context and 8B parameters.

2024-11128k context8B parameters

Current Hermes 3 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Hermes 3 Llama 3.1 405B	Use when the workload needs 128k context and 405B parameters.	2024-11	128k context405B parameters	Current
Hermes 3 Llama 3.1 70B	Use when the workload needs 128k context and 70B parameters.	2024-11	128k context70B parameters	Current
Hermes 3 Llama 3.1 8B	Use when the workload needs 128k context and 8B parameters.	2024-11	128k context8B parameters	Current

Release Timeline

1 release group

2024-11

3 current

Hermes 3 Llama 3.1 405B

128k context405B parameters

Current

Hermes 3 Llama 3.1 70B

128k context70B parameters

Current

Hermes 3 Llama 3.1 8B

128k context8B parameters

Current

Specifications(3 models)

Hermes 3 model specifications comparison
Model	Released	Context	Parameters
Hermes 3 Llama 3.1 405B	2024-11	128k	405B
Hermes 3 Llama 3.1 70B	2024-11	128k	70B
Hermes 3 Llama 3.1 8B	2024-11	128k	8B

Available From(1 provider)

OpenRouter

Pricing

Hermes 3 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Hermes 3 Llama 3.1 70B	OpenRouter	$0.7	$0.7	Serverless
Hermes 3 Llama 3.1 405B	OpenRouter	$1	$1	Serverless

Frequently Asked Questions

What is Hermes 3 used for?: Hermes 3 is used for agent workflows, structured outputs, and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does Hermes 3 compare to MOSS-Audio?: Hermes 3 by Nous Research is strongest where you need agent workflows, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Hermes 3 has 3 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which Hermes 3 model should I use?: For the lowest listed input price, start with Hermes 3 Llama 3.1 70B through OpenRouter at $0.7/1M input tokens. For the most capable/latest local choice, evaluate Hermes 3 Llama 3.1 405B with 128k context.