LLM Reference

OpenHermes 13B

About

OpenHermes-13B is a large language model (LLM) fine-tuned by Teknium on a dataset primarily containing GPT-4-generated data. Built on the Llama-2-13b-hf base model, it comprises 13 billion parameters and is trained on an open-source dataset with 242,000 entries. The training data includes contributions from sources like GPTeacher, WizardLM, and Airoboros GPT-4. Key features include filtering OpenAI refusals and a context length of 4096 tokens. While demonstrating strong text generation capabilities, especially in benchmarks like GPT4All, it shows limitations in reasoning and logic, as seen in the AGIEval score. Training details are accessible through WandB, highlighting its transparent development process.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

Parameters13B
ArchitectureDecoder Only
Specializationgeneral