Nous Llama 3 Models by Nous Research
About
The Nous Llama 3 large language model (LLM) family is crafted from Meta's Llama 3 architecture and fine-tuned by Nous Research. It includes models of varying sizes, such as the 8B, 70B, and an impressive 405B parameter model named Hermes 3. Hermes 3 is particularly notable for its superior reasoning abilities and adeptness in following instructions. The 8B parameter model, accessible on Hugging Face, excels in dialogue, outperforming many open-source chat models on traditional benchmarks. The Nous Llama 3 models cater to diverse applications, ranging from sophisticated chatbots and AI writing assistants to advanced AI agents, and are conveniently available on platforms like Ollama and Hugging Face 35.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 8k context and 70B parameters.
Use when the workload needs 8k context and 8B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Nous Llama 3 70B | Use when the workload needs 8k context and 70B parameters. | 2024-04 | 8k context70B parameters | Current |
| Nous Llama 3 8B | Use when the workload needs 8k context and 8B parameters. | 2024-04 | 8k context8B parameters | Current |
Release Timeline
1 release groupSpecifications(2 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Nous Llama 3 70B | 2024-04 | 8k | 70B |
| Nous Llama 3 8B | 2024-04 | 8k | 8B |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Nous Llama 3 8B | Microsoft Foundry | $0.37 | $1.1 | Provisioned |
Frequently Asked Questions
- What is Nous Llama 3 used for?
- Nous Llama 3 is used for agent workflows and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
- How does Nous Llama 3 compare to Hermes 2?
- Nous Llama 3 by Nous Research is strongest where you need agent workflows, while Hermes 2 by Nous Research is the closest related family to check for agent workflows and tool use. Nous Llama 3 has 2 listed variants and reaches up to 8k context, while Hermes 2 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
- Which Nous Llama 3 model should I use?
- For the lowest listed input price, start with Nous Llama 3 8B through Microsoft Foundry at $0.37/1M input tokens. For the most capable/latest local choice, evaluate Nous Llama 3 70B with 8k context.






