Compare AI models
Start with two models, inspect the tradeoff, then open a verdict-first detail page with pricing, benchmark, capability, and provider evidence.
Decision builder
Pick the pair before opening the detail page
Claude Opus 4.7 vs Kimi K2.6
Kimi K2.6 is ~567% cheaper at $0.75/1M; pay for Claude Opus 4.7 only for coding workflow support.
- Output price
- $25.00 / $3.50
- Context
- 1M / 262K
- Benchmarks
- 4 shared
- Providers
- 6 / 5
Popular pairs
Browse comparisons with a decision signal attached
DeepSeek V4 Pro vs GLM-5.1
DeepSeek V4 Pro is ~141% cheaper at $0.43/1M; pay for GLM-5.1 only for coding workflow support.
- Output price
- $0.870 / $3.50
- Context
- 1M / 200k
- Benchmarks
- 2 shared
- Providers
- 3 / 3
DeepSeek V4 Pro vs Kimi K2.6
DeepSeek V4 Pro is ~72% cheaper at $0.43/1M; pay for Kimi K2.6 only for coding workflow support.
- Output price
- $0.870 / $3.50
- Context
- 1M / 262K
- Benchmarks
- 7 shared
- Providers
- 3 / 5
Claude Sonnet 4.6 vs DeepSeek V4 Flash
DeepSeek V4 Flash is ~2043% cheaper at $0.14/1M; pay for Claude Sonnet 4.6 only for coding workflow support.
- Output price
- $15.00 / $0.280
- Context
- 1M / 1M
- Benchmarks
- 3 shared
- Providers
- 5 / 3
Gemini 2.5 Pro vs Grok 4
Grok 4 is safer overall; choose Gemini 2.5 Pro when coding workflow support matters.
- Output price
- $10.00 / $2.50
- Context
- 1M / 256k
- Benchmarks
- 2 shared
- Providers
- 3 / 4
DeepSeek V4 Flash vs GLM-5.1
DeepSeek V4 Flash is ~650% cheaper at $0.14/1M; pay for GLM-5.1 only for coding workflow support.
- Output price
- $0.280 / $3.50
- Context
- 1M / 200k
- Benchmarks
- 2 shared
- Providers
- 3 / 3
Claude Sonnet 4.6 vs Kimi K2.6
Kimi K2.6 is ~300% cheaper at $0.75/1M; pay for Claude Sonnet 4.6 only for coding workflow support.
- Output price
- $15.00 / $3.50
- Context
- 1M / 262K
- Benchmarks
- 4 shared
- Providers
- 5 / 5
DeepSeek V4 Flash vs Grok 4
DeepSeek V4 Flash is ~793% cheaper at $0.14/1M; pay for Grok 4 only for coding workflow support.
- Output price
- $0.280 / $2.50
- Context
- 1M / 256k
- Benchmarks
- 2 shared
- Providers
- 3 / 4
Qwen3.6-27B vs Qwen3.6-35B-A3B
Qwen3.6-35B-A3B is ~113% cheaper at $0.15/1M; pay for Qwen3.6-27B only for coding workflow support.
- Output price
- $3.20 / $1.00
- Context
- 262K / 262K
- Benchmarks
- 3 shared
- Providers
- 2 / 1
GLM-5 vs GLM-5.1
GLM-5 is ~75% cheaper at $0.6/1M; pay for GLM-5.1 only for coding workflow support.
- Output price
- $2.08 / $3.50
- Context
- 200k / 200k
- Benchmarks
- 1 shared
- Providers
- 5 / 3
Claude Opus 4.7 vs Kimi K2.6
Kimi K2.6 is ~567% cheaper at $0.75/1M; pay for Claude Opus 4.7 only for coding workflow support.
- Output price
- $25.00 / $3.50
- Context
- 1M / 262K
- Benchmarks
- 4 shared
- Providers
- 6 / 5
Llama 3 70B Instruct vs Llama 3.1 70B Instruct
Pick Llama 3.1 70B Instruct for coding; Llama 3 70B Instruct is better when provider fit matters more.
- Output price
- $0.400 / $0.400
- Context
- 8K / 128K
- Benchmarks
- 2 shared
- Providers
- 17 / 11
DeepSeek V4 Flash vs Kimi K2.6
DeepSeek V4 Flash is ~436% cheaper at $0.14/1M; pay for Kimi K2.6 only for coding workflow support.
- Output price
- $0.280 / $3.50
- Context
- 1M / 262K
- Benchmarks
- 5 shared
- Providers
- 3 / 5
DeepSeek V4 Flash vs Qwen3.6-27B
DeepSeek V4 Flash is ~129% cheaper at $0.14/1M; pay for Qwen3.6-27B only for coding workflow support.
- Output price
- $0.280 / $3.20
- Context
- 1M / 262K
- Benchmarks
- 3 shared
- Providers
- 3 / 2
Claude Sonnet 4.6 vs GPT-5.5 Pro
Claude Sonnet 4.6 is ~900% cheaper at $3/1M; pay for GPT-5.5 Pro only for coding workflow support.
- Output price
- $15.00 / $180.00
- Context
- 1M / 1.1M
- Benchmarks
- 2 shared
- Providers
- 5 / 2
Gemini 2.5 Flash vs Grok 4
Gemini 2.5 Flash is ~317% cheaper at $0.3/1M; pay for Grok 4 only for coding workflow support.
- Output price
- $2.50 / $2.50
- Context
- 1M / 256k
- Benchmarks
- 2 shared
- Providers
- 4 / 4
Gemini 2.5 Pro vs o3
Gemini 2.5 Pro is ~60% cheaper at $1.25/1M; pay for o3 only for coding workflow support.
- Output price
- $10.00 / $8.00
- Context
- 1M / 200K
- Benchmarks
- 5 shared
- Providers
- 3 / 2
DeepSeek V3.1 vs DeepSeek V4 Pro
DeepSeek V4 Pro fits 16x more tokens; pick it for long-context work and DeepSeek V3.1 for tighter calls.
- Output price
- $1.68 / $0.870
- Context
- 64K / 1M
- Benchmarks
- 2 shared
- Providers
- 6 / 3
Grok-3 vs Grok 4
Grok-3 is ~56% cheaper at $0.8/1M; pay for Grok 4 only for coding workflow support.
- Output price
- $2.40 / $2.50
- Context
- 1M / 256k
- Benchmarks
- 2 shared
- Providers
- 4 / 4