alignment

Definition

Alignment ensures LLMs produce outputs matching human values, preferences, and safety constraints through techniques like RLHF, DPO, or constitutional AI. It addresses the gap between raw predictive power and deployable utility by iteratively refining behaviors via feedback, reducing harms like bias.

Models Using alignment(12)

Granite 4.1 3B2026-04 Nanbeige4.1-3B2026-02 Doubao 1.5 Pro Vision 32K2025-01 Swallow 13B Instruct2024-12 Stable LM 2.5 1.6B Instruct2024-11 Swallow 7B Instruct2024-09 Saul 141B Instruct2024-07 Saul 54B Instruct2024-07 Dolphin 2.9.2 Qwen2-72B2024-05 InternLM XComposer2 4KHD 7B2024-04 XuanYuan 13B2024-02 InternLM2 1.8B2024-01