LLM ReferenceLLM Reference

Hermes 3

Nous ResearchOpen Source
3 models2024Up to 128K ctx

About

The Hermes 3 family of large language models (LLMs), developed by NousResearch, represents a significant advancement in generalist instruction models 146. Built upon the Llama 3.1 foundation model, Hermes 3 models are available in 8B, 70B, and 405B parameter versions 146. A key design principle is enhanced steerability, achieved through targeted training to precisely follow system and instruction prompts in a neutral and adaptive manner 146. This leads to models that are highly responsive to system prompts, allowing fine-grained control over behavior and persona 8. In addition to instruction following, Hermes 3 features long-term context retention, multi-turn conversation, complex role-playing, internal monologue abilities, and enhanced agentic function-calling 146. These models excel in structured output generation, utilizing XML tags and scratchpads for transparency and accuracy 1. The training data is a carefully curated blend of approximately 390 million tokens, including a significant portion of synthetically generated responses to encourage precise instruction following and nuanced reasoning 13. While NousResearch claims superior performance over Llama 3.1 in certain areas 46, independent evaluations indicate mixed results, underscoring the challenges in benchmarking and comparing LLMs 3.

Specifications(3 models)

Hermes 3 model specifications comparison
ModelReleasedContextParameters
Hermes 3 Llama 3.1 405B2024-11128K405B
Hermes 3 Llama 3.1 70B2024-11128K70B
Hermes 3 Llama 3.1 8B2024-11128K8B

Frequently Asked Questions

What is Hermes 3?
The Hermes 3 family of large language models (LLMs), developed by NousResearch, represents a significant advancement in generalist instruction models 146. Built upon the Llama 3.1 foundation model, Hermes 3 models are available in 8B, 70B, and 405B parameter versions 146. A key design principle is enhanced steerability, achieved through targeted training to precisely follow system and instruction prompts in a neutral and adaptive manner 146. This leads to models that are highly responsive to system prompts, allowing fine-grained control over behavior and persona 8. In addition to instruction following, Hermes 3 features long-term context retention, multi-turn conversation, complex role-playing, internal monologue abilities, and enhanced agentic function-calling 146. These models excel in structured output generation, utilizing XML tags and scratchpads for transparency and accuracy 1. The training data is a carefully curated blend of approximately 390 million tokens, including a significant portion of synthetically generated responses to encourage precise instruction following and nuanced reasoning 13. While NousResearch claims superior performance over Llama 3.1 in certain areas 46, independent evaluations indicate mixed results, underscoring the challenges in benchmarking and comparing LLMs 3.
How many models are in the Hermes 3 family?
The Hermes 3 family contains 3 models.
What is the latest Hermes 3 model?
The latest model is Hermes 3 Llama 3.1 405B, released in 2024-11.

Models(3)