LLM ReferenceLLM Reference
LLM Judgeactive

LLM Judge

Metric: Judge Score (higher is better)Introduced: 2023

About

LLM-as-a-judge evaluation paradigm using strong models (GPT-4) to score responses, enabling scalable assessment of open-ended generation quality.