LLM Reference

Kimi K2.5

Released
2026-03-15
Last refreshed
2026-06-04
Status
Researched 1d ago
ProprietaryCommercial use with conditionsMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Kimi K2.5 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 256k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Workloads where another current model has stronger sourced task evidence
Specifications
Family
Kimi
Released
2026-03-15
Context
256k
Parameters
1T (MoE, 384 experts)
Architecture
Mixture of Experts
Specialization
code
Openness
Proprietary
License
ProprietaryCommercial use with conditions
Training
finetuned
Created by

Lossless long-context AI innovation

Beijing, China
Founded 2023
Website
Pricing
Output / 1M
$2.00
Input / 1M
$0.440

Cheapest of 11 routes · OpenRouter

About

Kimi K2.5 is Moonshot AI's Kimi model focused on code generation and software engineering. It offers a 256K-token context window and scores 87.9 on GPQA.

Kimi K2.5 is a proprietary model in the Kimi family. The structured metadata tracks a 256k-token context window, multimodal input, function calling, and structured outputs. This page tracks provider routes through Cloudflare Workers AI, Fireworks AI, OpenRouter, and 8 more, with the cheapest tracked route listed at $0.44 input and $2 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 87.9, MMLU PRO 87.1, and BFCL 47.1.

Top use-case fit: coding, agents, and build tasks

Coding

Included by capability and metadata signals in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ B

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 11

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OpenRouter$0.440$2.00
Serverless
Together AI$0.500$2.80
Serverless
AWS Bedrock$0.600$3.00
Serverless
Fireworks AI$0.600$3.00
Serverless

Capabilities

VisionMultimodalFunction CallingStructured Outputs

Benchmark peer barsfor Agents

Benchmark scores(6)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Google-Proof Q&A87.9diamondhttps://artificialanalysis.ai/leaderboards/models
MMLU PRO87.1https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro
BFCL47.1v4https://gorilla.cs.berkeley.edu/leaderboard.html
τ-bench74.2τ-benchhttps://taubench.com/
MultiChallenge61.4MultiChallengehttps://labs.scale.com/leaderboard/multichallenge
SWE-rebench58.5pass@1 (best of 5 runs)https://swe-rebench.com/leaderboard

Migration checks

No linked migration route is available for this model yet.

Show all 65 popular comparisonssorted by 7-day search impressions
Kimi K2.5 vs GPT-5.5162Kimi K2.5 vs DeepSeek V3.1132Kimi K2.5 vs DeepSeek R1130Kimi K2.5 vs Xiaomi MiMo-V2.5128Kimi K2.5 vs Qwen3.5-397B-A17B123Kimi K2.5 vs Claude Opus 4.5105Kimi K2.5 vs Qwen3-235B-A22B105Kimi K2.5 vs GLM-5V-Turbo103Kimi K2.5 vs Gemini 3 Pro90Kimi K2.5 vs Xiaomi MiMo-V2.5-TTS-Series85Kimi K2.5 vs Qwen3.6-35B-A3B79Kimi K2.5 vs Qwen2.5-72B-Instruct73Kimi K2.5 vs Together AI Qwen2-72B-Instruct73Kimi K2.5 vs Claude Opus 4.672Kimi K2.5 vs Ling-2.6-Flash66Kimi K2.5 vs Qwen3.6-27B62Kimi K2.5 vs Grok-358Kimi K2.5 vs GLM-5 9B53Kimi K2.5 vs DeepSeek R1 Distill Llama 70B49Kimi K2.5 vs Mistral Large 3 675B Instruct47Kimi K2.5 vs GLM-5 Turbo46Kimi K2.5 vs DeepSeek R1 052846Kimi K2.5 vs Llama 3.1 70B Instruct45Kimi K2.5 vs Gemini 2.5 Flash Live API43Kimi K2.5 vs o338Kimi K2.5 vs Trinity-Large-Thinking38Kimi K2.5 vs Claude 3.7 Sonnet36Kimi K2.5 vs Together AI Qwen2-7B-Instruct33Kimi K2.5 vs Qwen3.5-35B-A3B32Kimi K2.5 vs GPT-5.227Kimi K2.5 vs Llama 3.1 405B Instruct27Kimi K2.5 vs Qwen2.5-7B-Instruct26Kimi K2.5 vs Llama 3 70B Instruct26Kimi K2.5 vs Gemini 2.5 Pro Computer Use Preview26Kimi K2.5 vs Llama 2 13B Chat24Kimi K2.5 vs Phi-3 Mini 4k23Kimi K2.5 vs Mistral Large 222Kimi K2.5 vs Tencent Hunyuan Turbo S21Kimi K2.5 vs Gemini 2.5 Flash20Kimi K2.5 vs Qwen3-9B20Kimi K2.5 vs Qwen3.5-27B19Kimi K2.5 vs Mistral Nemotron19Kimi K2.5 vs Gemma 7B Instruct17Kimi K2.5 vs o3 Mini17Kimi K2.5 vs DeepSeek V3.215Kimi K2.5 vs GPT-5.4 Pro14Kimi K2.5 vs Llama 3 8B Instruct11Kimi K2.5 vs Llama 3.2 1B11Kimi K2.5 vs Llama 2 70B Chat10Kimi K2.5 vs GPT-5.4-Cyber9Kimi K2.5 vs o3 Deep Research9Kimi K2.5 vs Qwen2.5-72B8Kimi K2.5 vs Together AI - Llama 3 8B Lite7Kimi K2.5 vs GPT-5.46Kimi K2.5 vs Mixtral 8x7B5Kimi K2.5 vs Qwen2-7B-Instruct4Kimi K2.5 vs Qwen3.5-122B-A10B4Kimi K2.5 vs Mixtral 8x22B Instruct v0.34Kimi K2.5 vs StepFun Step-24Kimi K2.5 vs Llama 3.2 1B Instruct3Kimi K2.5 vs Qwen3.5-9B3Kimi K2.5 vs Gemini 2.5 Pro Preview 05-062Kimi K2.5 vs Phi-4 Mini Flash Reasoning2Kimi K2.5 vs Qwen2.5-Max1Kimi K2.5 vs Qwen3-Max0