What is Hermes 4 used for?

Hermes 4 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.

How does Hermes 4 compare to MOSS-Audio?

Hermes 4 by Nous Research is strongest where you need reasoning, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Hermes 4 has 2 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.

Which Hermes 4 model should I use?

Hermes-4-70B is both the lowest listed input-price option at $0.05/1M input tokens through Nous Portal and the strongest local starting point with 128k context and reasoning. Use the provider table if latency, deployment type, or output-token pricing matters more than input price.

Hermes 4 Models by Nous Research

Nous ResearchLlama 3 CommunityOpen weightsOpen Source

2 models2025Up to 128k ctxFrom $0.05/1M input

Details

ResearcherNous Research

LicenseLlama 3 Community

Commercial useCommercial use: conditional

Models2

Released2025

Max context128k

Capabilities

ReasoningAll models

Links

Website

About

The Hermes 4 family is Nous Research's open-source instruction-tuned series built on Llama 3.1 foundations, spanning 70B and 405B parameter variants with hybrid reasoning behavior and hosted availability through Nous Portal.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

2 in view

Hermes-4-70BCurrent

Use when the workload needs 128k context, 70B parameters, and reasoning.

2025-09128k context70B parametersreasoning

Hermes-4-405BCurrent

Use when the workload needs 128k context, 405B parameters, and reasoning.

2025-09128k context405B parametersreasoning

Current Hermes 4 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Hermes-4-70B	Use when the workload needs 128k context, 70B parameters, and reasoning.	2025-09	128k context70B parametersreasoning	Current
Hermes-4-405B	Use when the workload needs 128k context, 405B parameters, and reasoning.	2025-09	128k context405B parametersreasoning	Current

Release Timeline

1 release group

2025-09

2 current

Hermes-4-405B

128k context405B parametersreasoning

Current

Hermes-4-70B

128k context70B parametersreasoning

Current

Specifications(2 models)

Hermes 4 model specifications comparison
Model	Released	Context	Parameters	Reasoning
Hermes-4-70B	2025-09	128k	70B	Yes
Hermes-4-405B	2025-09	128k	405B	Yes

Available From(2 providers)

Nous Portal

OpenRouter

Pricing

Hermes 4 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Hermes-4-70B	Nous Portal	$0.05	$0.2	Serverless
Hermes-4-405B	Nous Portal	$0.09	$0.37	Serverless
Hermes-4-70B	OpenRouter	$0.13	$0.4	Serverless
Hermes-4-405B	OpenRouter	$1	$3	Serverless

Frequently Asked Questions

What is Hermes 4 used for?: Hermes 4 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.
How does Hermes 4 compare to MOSS-Audio?: Hermes 4 by Nous Research is strongest where you need reasoning, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Hermes 4 has 2 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which Hermes 4 model should I use?: For the lowest listed input price, start with Hermes-4-70B through Nous Portal at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Hermes-4-70B with 128k context and reasoning.

Models(2)