supervised fine-tuning

SFT

See matching models with benchmark scores and pricing.

Definition

Supervised fine-tuning (SFT) adapts a pretrained model on labeled instruction-response pairs to improve task-specific performance, like following directives. It aligns general models to user needs with minimal data and precedes RLHF, enhancing instruction adherence and reducing hallucinations.

Models Mentioning supervised fine-tuning(12)

Llama 3.1 405B Instruct2024-07 Llama 3.1 70B Instruct2024-07 Llama 3.1 8B Instruct2024-07 Llama 3.1 405B2024-07 Llama 3.1 8B2024-07 StarChat2 15B2024-07 SeaLLM 7B V2.52024-07 GLM-4-Extreme2024-06 Qwen2-0.5B2024-06 InternLM2 Math Plus 20B2024-05 Phi-3 Medium 128K2024-05 Phi-3 Medium 4K2024-05