llmreference
Decision hubLive seed shortlist

Choosing an LLM API for your app. Start here.

Last refreshed 2026-05-11. Next refresh: weekly.

Pick the integration path by priority first, then drill into model, host, price, benchmark, and freshness evidence before you commit code.

Decision strip

Start with the constraint that will break the app.

Live shortlist

Three API routes worth opening first.

Open full API leaderboard

Frontier quality

Gemini 3.1 Pro Preview

Highest GPQA route in tracked API data

Use this when benchmark headroom matters more than token price.

$/1M input

$2.00

$12.00 output

Google-Proof Q&A

94.3%

tracked signal

Host
Google AI Studio
Batch price
Not tracked
Research age
Researched 8d ago · 2026-05-11

Price/performance

Gemma 4 E4B IT

Best benchmark-per-input-dollar route

Use this when you need credible quality without letting input cost run away.

$/1M input

$0

$0 output

Google-Proof Q&A / input $

58,600x

tracked signal

Host
GCP Vertex AI
Batch price
Not tracked
Research age
Researched 8d ago · 2026-05-11

OAI-compatible alternative

Kimi K2.6

Top non-OpenAI host with chat-completions evidence

Use this when a base URL swap is attractive, then verify IDs and headers.

$/1M input

$0.950

$4.00 output

Google-Proof Q&A

90.5%

tracked signal

Host
Fireworks AI
Batch price
Not tracked
Research age
Researched 8d ago · 2026-05-11

Browse by intent

Route the next click to the real comparison job.

Integration checklist

Verify the API surface before you ship.

These checks prevent model-level capability from being mistaken for provider-route capability.

Streaming

Route docs

Confirm the provider route supports streaming responses before you design UI latency around it.

Function calling

Open

Prefer models with tool-use or structured-output evidence when the app calls your own functions.

Batch APIs

Open

Use batch-price evidence for offline enrichment, eval generation, and cost-sensitive backfills.

OAI-compatible routing

Route docs

Check model IDs, base URLs, auth headers, and SDK support before treating a provider swap as drop-in.

Data-feed signal

We maintain pricing, benchmarks, and freshness for every model on this site. Want it as a JSON feed in your tooling?

Tell us