Concepts & capability filters
Activation-aware Weight Quantization
AWQ
See matching models with benchmark scores and pricing.
Definition
AWQ (Activation-aware Weight Quantization) quantizes LLM weights based on activation statistics, preserving the most important weights for model performance. It achieves higher accuracy than uniform quantization at lower bit-widths by adapting quantization granularity to activation patterns.