LLM ReferenceLLM Reference

Hermes 2 Theta Llama 3 70B

About

The Hermes-2 Theta Llama-3 70B is an advanced large language model created through the fusion of Nous Research's Hermes 2 Pro and Meta's Llama-3 Instruct models. Enhanced by Reinforcement Learning from Human Feedback (RLHF), it contains 70 billion parameters and utilizes a diverse dataset comprising web content, scientific literature, and synthetic data. With a knowledge cutoff in early 2024, the model excels in generating structured outputs, executing function calls, and handling multi-turn conversations. It showcases strong performance in various benchmarks, often surpassing GPT-4 and similar models. Despite its capabilities, it has limitations like complex prompt formats, substantial VRAM requirements, and dependency on user input quality.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Rankings

Specifications

FamilyHermes 2
Released2023-12-12
Parameters70B
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Human-centric AI model innovation

New York, New York, United States
Founded 2023
Website