LLM Reference

OpenChat 3.6 8B

About

OpenChat 3.6 8B is an open-source LLM built on the Llama 3 architecture, fine-tuned with offline reinforcement learning techniques to outperform other models like Llama 3 8B Instruct. Known for its competence in general conversation, coding support, and mathematical reasoning, it does face limitations with complex reasoning and occasionally generating inaccurate information. The model's deployment is optimized for consumer GPUs with 24GB RAM, supporting tensor parallelism for enhanced processing speed. It is accessible on platforms like Hugging Face, facilitating experimentation and usage.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
deepinfra API
Serverless

Specifications

Released2024-05-22
Parameters8B
Context8K
ArchitectureDecoder Only
Specializationgeneral