LLM Reference

Best LLMs by Use Case

Last refreshed 2026-07-01. Next refresh: weekly.

Ranked shortlists for every job — coding, agents, function calling, long context, vision, RAG, and more — with comparison shortcuts and API pricing for each pick.

Best LLMs for Code Generation

The top coding LLMs in 2026, ranked by SWE-bench and HumanEval. Includes API pricing and context window for each pick — updated daily.

Best LLMs for RAG

Best LLMs for RAG in 2026 — ranked by context window, retrieval benchmarks, and tool support. Covers document QA to enterprise search.

Best AI Agent Models 2026: SWE-bench Ranked

Compare the best LLMs for agentic workflows in 2026, ranked by SWE-bench Verified and τ-bench. See top picks with provider availability, context, and pricing.

Best LLMs for Classification

Best LLMs for text classification, routing, and moderation in 2026. Covers extraction, safety labeling, and structured output tasks.

Best Open Source LLMs

The best open-weight LLMs in 2026, ranked by benchmark scores. Run locally, self-host, or deploy on your own infra — no API key required.

Best Multimodal / Vision LLMs

Best vision and multimodal LLMs in 2026, ranked by image benchmarks. Covers image QA, document understanding, and video analysis.

Best LLM for Translation in 2026

Compare multilingual LLMs and dedicated translation models for text, document, and live speech translation. Ranked by context length and general-language benchmarks until translation-specific leaderboard rows land in seed data.

Best AI Image Models in 2026

Compare image generation and editing models across GPT Image, Gemini Nano Banana, FLUX, Ideogram, and related creative providers. Distinct from /best/vision, which ranks multimodal image-understanding LLMs.

Best AI Video Models in 2026

Compare video generation models across Sora, Veo, Runway, Kling, Luma, and related creative providers. Distinct from /best/vision, which ranks multimodal models that understand video inputs.

Best LLMs for Reasoning & Math

Best reasoning and math LLMs in 2026, ranked by GPQA Diamond and MMLU. Includes thinking-mode models and chain-of-thought specialists.

Best Small Language Models (SLMs)

The best small LLMs under 10B parameters in 2026 — fast, cheap, and deployable on-device or at the edge with strong benchmark scores.

Best LLMs for Function Calling & Tool Use

Best LLMs for function calling and tool use in 2026. Ranked by BFCL benchmark, with native JSON output and structured output support.

Cheapest LLM APIs You Can Call Right Now

The cheapest LLM APIs you can call today, ranked by input price with a quality score beside each so you see the trade-off.

Best Long Context LLMs

LLMs with the largest context windows in 2026 — from 128K to 2M tokens. Ranked by window size with pricing and retrieval accuracy.

Best Mainstream LLM APIs, Ranked

The best LLM APIs for developers in 2026 — ranked by benchmark quality first, then price. Updated daily with live provider pricing.

Best LLMs for Enterprise

Enterprise-grade LLMs available on AWS Bedrock, Azure AI Foundry, and Vertex AI — with SLAs, function calling, and structured outputs.

Best Free LLMs You Can Use Right Now

Free LLMs you can use right now: zero-cost hosted tiers first, then open-weight models you can self-host. Updated as free tiers change.

Best LLMs for Writing

The best LLMs for writing in 2026, ranked by human preference. Covers long-form essays, creative prose, and marketing copy — with pricing.

Best LLMs for Marketing

Top language models for marketing copy, ad creative, email, social posts, and brand-voice content. Ranked by Chatbot Arena human-preference scores with MMLU as a fallback.

Best LLMs for Customer Support

Function-calling models for support bots, ranked by tau-bench service-task performance with BFCL fallback and a $25 per 1k conversation cost gate.

New to LLMs?

Start with these plain-English guides before diving into the ranked shortlists.

Routing layer

Pre-filtered router and gateway views for teams that want a routing layer before, or instead of, a single model pick.

Best LLM gateways

LLM gateways sit between your app and one or more model providers. Use this list when you want a unified endpoint, provider failover, observability, or governance without changing the model decision itself.

Best LLM routers

LLM routers choose the model or tier for each request. Use this page when cost, quality, latency, or reliability depends on routing policy rather than one static model pick.

OpenRouter alternatives

OpenRouter is the flagship hybrid router in this seed, but it is not the only way to unify model access. This view excludes OpenRouter and keeps the alternatives that still give you gateway or hybrid routing behavior.

Self-hosted LLM routers

Self-hosted router and gateway options keep routing policy closer to your infrastructure. Use this list when vendor lock-in, data boundaries, or deployment control matter more than hosted convenience.

Open-source LLM routers

Open-source router and gateway projects are the right first stop when you need inspectable routing policy, self-host paths, or a proxy you can adapt. This page keeps open-source rows in one view across gateway, router, and hybrid types.

Cheapest LLM gateway options

Cost-focused routing can lower spend by sending simple requests to cheaper models or cheaper provider routes. Use this page for gateways and routers whose verified objective includes cost optimization.

LLM cost optimization routers

Cost optimization routers decide when a cheaper model or provider route is good enough. Use this list when you already know the task shape and need a routing layer to keep quality while reducing token spend.