The summarization leaderboard · for knowledge workers

Best for summarization

4 editor picks · 8 eligible models · Faithful, structured, no hallucinated bullets.

Editorial pick plus benchmark and API pricing context.

See raw /best

EDITOR'S CHOICEResearched 50d ago

Gemini 3 Flash

Google DeepMind · 1m context

Excellent

Faithful, structured, cheap — and it won't invent bullets.

1M context plus MMLU-Pro 88.6 at $3 out — handles a 500-page transcript faithfully without breaking the budget.

Open model

The numbers

$/1M out

$3.00

$0.50 input

Context

max window

Pros

+1M context for whole-doc summaries
+Faithful, well-structured
+$3 / 1M out

Cons

−Less nuanced than Opus on subtext

Also worth picking

The runners-up

ranked by editorial pick orderEditorial tiersExcellentStrongSolid

#ModelTier$/1M outEditor's note

Claude Haiku 4.5

Anthropic · 200k

$4.00 / 1M out

Best faithfulness-per-dollar in the Claude line; doesn't hallucinate bullet points.

Claude Haiku 4.5

Anthropic · 200k

$4.00

Best faithfulness-per-dollar in the Claude line; doesn't hallucinate bullet points.

DeepSeek V4 Flash

DeepSeek · 1m

$0.18 / 1M out

$0.22 out with a 1M window — the high-volume async summarizer.

DeepSeek V4 Flash

DeepSeek · 1m

$0.18

$0.22 out with a 1M window — the high-volume async summarizer.

Claude Sonnet 4.6

Anthropic · 1m

$15.00 / 1M out

Highest-fidelity option when the summary is going in front of a customer.

Claude Sonnet 4.6

Anthropic · 1m

$15.00

Highest-fidelity option when the summary is going in front of a customer.

Eligibility

8 models are eligible for this board

Eligibility means tagged with useCases: [summarization]. Pins must come from this pool.

All picks