LLM ReferenceLLM Reference
Replicate API

Nous Hermes 2 Yi 34B on Replicate API

Hermes 2 · Nous Research

Serverless

Last refreshed 2026-04-19. Next refresh: weekly.

Why use Nous Hermes 2 Yi 34B on Replicate API?

Replicate API offers Nous Hermes 2 Yi 34B with pay-as-you-go pricing at $0.20/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare Nous Hermes 2 Yi 34B across 3 providers to find the best fit for your use case
Input / 1M
$0.20
Output / 1M
$1.00
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install replicate
Auth
export REPLICATE_API_TOKEN=...
Call
import replicate
output = replicate.run(
    "nous-hermes2-yi-34b",
    input={"prompt": "Hello"}
Model ID
nous-hermes2-yi-34b

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# nous-hermes2-yi-34b format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "nous-hermes2-yi-34b",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

  • Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
  • The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Compare Nous Hermes 2 Yi 34B Across Providers

ProviderInput (per 1M)Output (per 1M)
Together AI$0.80$0.80
Fireworks AI$0.90$0.90
Replicate API$0.20$1.00

Pricing

TypePrice (per 1M)
Input tokens$0.20
Output tokens$1.00

Capabilities

No model capability flags are currently sourced.

About Nous Hermes 2 Yi 34B

34B Hermes variant building on Yi 34B with enhanced reasoning and problem-solving via training on diverse high-quality datasets.

FAQ

What does Nous Hermes 2 Yi 34B cost on Replicate API?

On Replicate API, Nous Hermes 2 Yi 34B costs $0.20 per 1M input tokens and $1.00 per 1M output tokens.

What is the context window for Nous Hermes 2 Yi 34B on Replicate API?

Nous Hermes 2 Yi 34B supports a 200,000 token context window on Replicate API.

How does Replicate API compare to other Nous Hermes 2 Yi 34B providers?

Nous Hermes 2 Yi 34B is available from 3 providers. The cheapest input pricing is $0.20/1M tokens from Replicate API.

Who created Nous Hermes 2 Yi 34B?

Nous Hermes 2 Yi 34B was created by Nous Research as part of the Hermes 2 model family.

Is Nous Hermes 2 Yi 34B open source?

Nous Hermes 2 Yi 34B's open source status is unknown in the seed data.

Get Started

Model Specs

Released2023-12-12
Parameters34B
Context200K
ArchitectureDecoder Only
Knowledge cutoff2024-03