LLM Reference

Nous Llama 3 Models by Nous Research

2 models2024Up to 8k ctxFrom $0.37/1M input

About

The Nous Llama 3 large language model (LLM) family is crafted from Meta's Llama 3 architecture and fine-tuned by Nous Research. It includes models of varying sizes, such as the 8B, 70B, and an impressive 405B parameter model named Hermes 3. Hermes 3 is particularly notable for its superior reasoning abilities and adeptness in following instructions. The 8B parameter model, accessible on Hugging Face, excels in dialogue, outperforming many open-source chat models on traditional benchmarks. The Nous Llama 3 models cater to diverse applications, ranging from sophisticated chatbots and AI writing assistants to advanced AI agents, and are conveniently available on platforms like Ollama and Hugging Face 35.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Use when the workload needs 8k context and 70B parameters.

2024-048k context70B parameters

Use when the workload needs 8k context and 8B parameters.

2024-048k context8B parameters

Release Timeline

1 release group
2024-04
2 current
Nous Llama 3 70B
8k context70B parameters
Current
Nous Llama 3 8B
8k context8B parameters
Current

Specifications(2 models)

Nous Llama 3 model specifications comparison
ModelReleasedContextParameters
Nous Llama 3 70B2024-048k70B
Nous Llama 3 8B2024-048k8B

Available From(1 provider)

Pricing

Nous Llama 3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Nous Llama 3 8BMicrosoft Foundry$0.37$1.1Provisioned

Frequently Asked Questions

What is Nous Llama 3 used for?
Nous Llama 3 is used for agent workflows and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does Nous Llama 3 compare to Hermes 2?
Nous Llama 3 by Nous Research is strongest where you need agent workflows, while Hermes 2 by Nous Research is the closest related family to check for agent workflows and tool use. Nous Llama 3 has 2 listed variants and reaches up to 8k context, while Hermes 2 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Nous Llama 3 model should I use?
For the lowest listed input price, start with Nous Llama 3 8B through Microsoft Foundry at $0.37/1M input tokens. For the most capable/latest local choice, evaluate Nous Llama 3 70B with 8k context.

Models(2)