LLM Reference
Concepts & capability filters

GPT-Generated Unified Format

GGUF

See matching models with benchmark scores and pricing.

Definition

GGUF (GPT-Generated Unified Format) is a quantized model file format for efficient storage and loading of LLMs, providing a unified structure for different quantization schemes and metadata. It enables seamless model distribution and loading across different hardware configurations.