LLM Reference

Hermes 2 Models by Nous Research

Nous ResearchOpen Source
14 models2023Up to 200k ctxFrom $0.05/1M input

About

The Hermes 2 family of large language models, crafted by Nous Research, represents a suite of advanced models derived from diverse base architectures and refined for exceptional capabilities. Known for their superior performance across a wide spectrum of both general tasks and conversations, these models shine particularly in function calling and generating structured JSON outputs 23. Hermes 2 incorporates ChatML, facilitating structured multi-turn dialogues and specialized prompts tailored for dependable function calling and JSON output. Comprising models built on Llama 3 and Mistral architectures, they offer varying parameter configurations that cater to diverse performance needs 2. These models are evaluated highly in accuracy for function calling and JSON generation, making them apt for applications where reliable and interpretable machine-generated responses are crucial 4.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

14 in view

Use when the workload needs 8k context and 8B parameters.

2023-128k context8B parameters

Use when the workload needs 8k context and 8B parameters.

2023-128k context8B parameters

Use when the workload needs 32k context, 7B parameters, and function calling.

2023-1232k context7B parametersfunction calling

Use when the workload needs 32k context.

2023-1232k context

Use when the workload needs 32k context and 7B parameters.

2023-1232k context7B parameters

Use when the workload needs 200k context and 34B parameters.

2023-12200k context34B parameters

Use when the workload needs 4k context and 10.7B parameters.

2023-124k context10.7B parameters

Use when the workload needs 8k context and 70B parameters.

2023-128k context70B parameters

Use when the workload needs 8k context and 70B parameters.

2023-128k context70B parameters

Use when the workload needs 4k context and 70B parameters.

2023-124k context70B parameters

Use when the workload needs 4k context and 13B parameters.

2023-124k context13B parameters

Use when the workload needs 4k context and 7B parameters.

2023-124k context7B parameters

Use when the workload needs 33k context and 56B parameters.

2023-1233k context56B parameters

Use when the workload needs 33k context and 56B parameters.

2023-1233k context56B parameters

Release Timeline

1 release group
2023-12
14 current
Hermes 2 Pro Llama 3 70B
8k context70B parameters
Current
Hermes 2 Pro Llama 3 8B
8k context8B parameters
Current
Hermes 2 Pro Mistral 7B
32k context7B parametersfunction calling
Current
Hermes 2 Theta Llama 3 70B
8k context70B parameters
Current
Hermes 2 Theta Llama 3 8B
8k context8B parameters
Current
Nous Hermes 2 Llama 2 13B
4k context13B parameters
Current
Nous Hermes 2 Llama 2 70B
4k context70B parameters
Current
Nous Hermes 2 Llama 2 7B
4k context7B parameters
Current
Nous Hermes 2 Mistral 7B
32k context7B parameters
Current
Current
Nous Hermes 2 SOLAR 10.7B
4k context10.7B parameters
Current
Nous Hermes 2 Yi 34B
200k context34B parameters
Current
OctoML Nous-Hermes-2-Mixtral-8x7B-DPO
33k context56B parameters
Current
Current

Specifications(14 models)

Hermes 2 model specifications comparison
ModelReleasedContextParametersFn Calling
Hermes 2 Pro Llama 3 8B2023-128k8BNo
Hermes 2 Theta Llama 3 8B2023-128k8BNo
Hermes 2 Pro Mistral 7B2023-1232k7BYes
Nous Hermes 2 Mixtral 8x7B2023-1232k8x7BNo
Nous Hermes 2 Mistral 7B2023-1232k7BNo
Nous Hermes 2 Yi 34B2023-12200k34BNo
Nous Hermes 2 SOLAR 10.7B2023-124k10.7BNo
Hermes 2 Theta Llama 3 70B2023-128k70BNo
Hermes 2 Pro Llama 3 70B2023-128k70BNo
Nous Hermes 2 Llama 2 70B2023-124k70BNo
Nous Hermes 2 Llama 2 13B2023-124k13BNo
Nous Hermes 2 Llama 2 7B2023-124k7BNo
Together AI Nous-Hermes-2-Mixtral-8x7B-DPO2023-1233k56BNo
OctoML Nous-Hermes-2-Mixtral-8x7B-DPO2023-1233k56BNo

Available From(9 providers)

Frequently Asked Questions

What is Hermes 2 used for?
Hermes 2 is used for agent workflows and tool use and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Hermes 2 compare to MOSS-Audio?
Hermes 2 by Nous Research is strongest where you need agent workflows and tool use, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Hermes 2 has 14 listed variants and reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Hermes 2 model should I use?
For the lowest listed input price, start with Hermes 2 Theta Llama 3 8B through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Hermes 2 Pro Mistral 7B with 32k context and function calling.

Models(14)