Best LLMs by Use Case

Last refreshed 2026-07-01. Next refresh: weekly.

Ranked shortlists for every job — coding, agents, function calling, long context, vision, RAG, and more — with comparison shortcuts and API pricing for each pick.

Best LLMs for Code Generation

The top coding LLMs in 2026, ranked by SWE-bench and HumanEval. Includes API pricing and context window for each pick — updated daily.

Best LLMs for RAG

Best LLMs for RAG in 2026 — ranked by context window, retrieval benchmarks, and tool support. Covers document QA to enterprise search.

Best AI Agent Models 2026: SWE-bench Ranked

Compare the best LLMs for agentic workflows in 2026, ranked by SWE-bench Verified and τ-bench. See top picks with provider availability, context, and pricing.

Best LLMs for Classification

Best LLMs for text classification, routing, and moderation in 2026. Covers extraction, safety labeling, and structured output tasks.

Best Open Source LLMs

The best open-weight LLMs in 2026, ranked by benchmark scores. Run locally, self-host, or deploy on your own infra — no API key required.

Best Multimodal / Vision LLMs

Best vision and multimodal LLMs in 2026, ranked by image benchmarks. Covers image QA, document understanding, and video analysis.

Best LLM for Translation in 2026

Compare multilingual LLMs and dedicated translation models for text, document, and live speech translation. Ranked by context length and general-language benchmarks until translation-specific leaderboard rows land in seed data.

Best AI Image Models in 2026

Compare image generation and editing models across GPT Image, Gemini Nano Banana, FLUX, Ideogram, and related creative providers. Distinct from /best/vision, which ranks multimodal image-understanding LLMs.

Best AI Video Models in 2026

Compare video generation models across Sora, Veo, Runway, Kling, Luma, and related creative providers. Distinct from /best/vision, which ranks multimodal models that understand video inputs.

Best LLMs for Reasoning & Math

Best reasoning and math LLMs in 2026, ranked by GPQA Diamond and MMLU. Includes thinking-mode models and chain-of-thought specialists.

Best Small Language Models (SLMs)

The best small LLMs under 10B parameters in 2026 — fast, cheap, and deployable on-device or at the edge with strong benchmark scores.

Best LLMs for Function Calling & Tool Use

Best LLMs for function calling and tool use in 2026. Ranked by BFCL benchmark, with native JSON output and structured output support.

Cheapest LLM APIs You Can Call Right Now

The cheapest LLM APIs you can call today, ranked by input price with a quality score beside each so you see the trade-off.

Best Long Context LLMs

LLMs with the largest context windows in 2026 — from 128K to 2M tokens. Ranked by window size with pricing and retrieval accuracy.

Best Mainstream LLM APIs, Ranked

The best LLM APIs for developers in 2026 — ranked by benchmark quality first, then price. Updated daily with live provider pricing.

Best LLMs for Enterprise

Enterprise-grade LLMs available on AWS Bedrock, Azure AI Foundry, and Vertex AI — with SLAs, function calling, and structured outputs.

Best Free LLMs You Can Use Right Now

Free LLMs you can use right now: zero-cost hosted tiers first, then open-weight models you can self-host. Updated as free tiers change.

Best LLMs for Writing

The best LLMs for writing in 2026, ranked by human preference. Covers long-form essays, creative prose, and marketing copy — with pricing.

Best LLMs for Marketing

Top language models for marketing copy, ad creative, email, social posts, and brand-voice content. Ranked by Chatbot Arena human-preference scores with MMLU as a fallback.

Best LLMs for Customer Support

Function-calling models for support bots, ranked by tau-bench service-task performance with BFCL fallback and a $25 per 1k conversation cost gate.

New to LLMs?

Start with these plain-English guides before diving into the ranked shortlists.

What Is an LLM?

How language models work, in plain English

How to Choose an LLM

A 5-step framework: task, budget, context, provider, openness

LLM API Pricing Explained

Tokens, input vs output cost, batch pricing, cost estimator

Routing layer

Pre-filtered router and gateway views for teams that want a routing layer before, or instead of, a single model pick.