LLM Reference

The data & sql leaderboard · for knowledge workers

Best for data & sql

4 editor picks · 6 eligible models · Reads schemas, writes queries that actually run.

See raw /best
EDITOR'S CHOICEResearched 3d ago

GPT-5.5

OpenAI · 1M context
Excellent

Reads schemas and writes queries that actually run.

Best text-to-SQL accuracy in production — HumanEval 94.2 and top-tier reasoning pick the right join and respect dialect quirks.

The numbers
$/1M out
$30.00
$5.00 input
Context
1M
max window
Pros
  • +Top SQL accuracy & reasoning
  • +Good dialect awareness
  • +1M context for big schemas
Cons
  • $30 / 1M out — drop to DeepSeek V4 Pro for high volume

Also worth picking

The runners-up

ranked by editorial pick order
Editorial tiersExcellentStrongSolid
#ModelTier$/1M outEditor's note
#2
Anthropic · 1M
$15.00
Best at reasoning over messy schemas with hundreds of tables.
#3
DeepSeek · 1M
$0.87
LiveCodeBench leader at $0.87 out — excellent SQL value, open weights.
#4
Google DeepMind · 1M
$5.00
A 1M window fits the whole schema for one-shot exploration.

Eligibility

6 models are eligible for this board

Eligibility means tagged with useCases: [data-sql]. Pins must come from this pool.

All picks