LLM Reference

Kimi K2 Models by Moonshot AI

Moonshot AIMITOpen source
11 models2025–2026Up to 262k ctxFrom $0.5/1M input

Details

ResearcherMoonshot AI
LicenseMITOSI-approved
Commercial useCommercial use: permitted
Models11
Released2025–2026
Max context262k

Capabilities

Vision3 of 11 models
Multimodal3 of 11 models
Reasoning5 of 11 models
Function Calling7 of 11 models
Tool Use3 of 11 models
Structured Outputs6 of 11 models

Links

Website

About

Kimi K2 is Moonshot AI's frontier mixture-of-experts family for agentic coding and long-context reasoning. The line spans 8 models, from the original Kimi K2 release through Kimi K2 Thinking and the current flagship Kimi K2.6, with 1T total parameters and 32B active per forward pass. Kimi K2.6 scores 1454 on Chatbot Arena, 92 on HumanEval, 89.6 on LiveCodeBench, and 80.2 on SWE-bench Verified, putting it near Claude Opus 4.7 and DeepSeek V4 Pro for coding tasks at lower listed API prices.

Compare Against Top Competitors

Frontier coding comparison set for teams deciding between Kimi K2.6, Claude, DeepSeek V4, and GPT-5-class models.Scores come from existing benchmark seed data; "-" means this site has no local score for that benchmark yet.

Kimi K2 flagship benchmark comparison against top competitor models
ModelContextInput / 1MChatbot ArenaSWE-bench VerifiedLiveCodeBenchHumanEval
Kimi K2.6family pick262k$0.73/1M1,46280.289.692
Claude Opus 4.71m-1,50387.6--
DeepSeek V4 Pro1m-1,45680.693.576.8
GPT-5.51.05m-1,48882.6-94.2
Claude Sonnet 4.61m-1,45979.68098

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

10 in view1 retired

Use when the workload needs code, 262k context, and 1000B parameters.

2026-06code262k context1000B parameters

Use when the workload needs code, 262k context, and 1000B parameters.

2026-06code262k context1000B parameters
Kimi K2.6Current

Use when the workload needs code, 262k context, and 1000B parameters.

2026-04code262k context1000B parameters

Use when the workload needs 262k context.

2025-11262k context

Use when the workload needs 131k context, reasoning, and structured outputs.

2025-09131k contextreasoningstructured outputs

Use when the workload needs 131k context.

2025-09131k context

Use when the workload needs 262k context, 1K parameters, and function calling.

2025-09262k context1K parametersfunction calling

Use when the workload needs 262k context, 1K parameters, and function calling.

2025-08262k context1K parametersfunction calling

Use when the workload needs 131k context, 1K parameters, and function calling.

2025-07131k context1K parametersfunction calling

Use when the workload needs reasoning, 256k context, and structured outputs.

2025-01reasoning256k contextstructured outputs

Release Timeline

7 release groups
2026-06
2 current
Kimi K2.7-Code
code262k context1000B parameters
Current
Kimi K2.7-Code HighSpeed
code262k context1000B parameters
Current
2026-04
1 current
Kimi K2.6
code262k context1000B parameters
Current
2025-11
1 current
Current
2025-09
3 current
Kimi K2 0905 Preview
262k context1K parametersfunction calling
Current
Kimi K2 Instruct
131k contextreasoningstructured outputs
Current
Current
2025-08
1 current
Kimi K2 Turbo Preview
262k context1K parametersfunction calling
Current
2025-07
1 current · 1 retired
Kimi K2
262k context1K parametersfunction calling
Replaced
Kimi K2 0711 Preview
131k context1K parametersfunction calling
Current
2025-01
1 current
Kimi K2 Thinking
reasoning256k contextstructured outputs
Current

Replaced By

Keep for legacy integrations; evaluate Kimi K2.6 before new work.

Specifications(11 models)

Kimi K2 model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningFn CallingTool UseStructured Outputs
Kimi K2.7-Code HighSpeed2026-06262k1TYesYesYesYesYesYes
Kimi K2.7-Code2026-06262k1TYesYesYesYesYesYes
Kimi K2.62026-04262k1TYesYesYesYesYesYes
Kimi K2 Thinking Turbo2025-11262k1T (32B active)NoNoNoNoNoNo
Kimi K2 Instruct2025-09131k1T total, 32B active (MoE)NoNoYesNoNoYes
Kimi K2 Instruct 09052025-09131k1T total, 32B active (MoE)NoNoNoNoNoNo
Kimi K2 0905 Preview2025-09262k1KNoNoNoYesNoNo
Kimi K2 Turbo Preview2025-08262k1KNoNoNoYesNoNo
Kimi K2 0711 Preview2025-07131k1KNoNoNoYesNoNo
Kimi K2 Thinking2025-01256k1T (32B active)NoNoYesNoNoYes

Pricing

Kimi K2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Kimi K2 InstructVercel AI Gateway$0.57$2.3Serverless
Kimi K2 InstructNovita AI$0.57$2.3Serverless
Kimi K2 InstructFireworks AI$0.6$2.5Serverless
Kimi K2 Instruct 0905Fireworks AI$0.6$2.5Serverless
Kimi K2 ThinkingFireworks AI$0.6$2.5Serverless
Kimi K2 ThinkingGCP Vertex AI$0.6$2.5Serverless
Kimi K2 ThinkingAWS Bedrock$0.6$2.5Serverless
Kimi K2 ThinkingOpenRouter$0.6$2.5Serverless
Kimi K2 ThinkingVercel AI Gateway$0.6$2.5Serverless
Kimi K2 ThinkingNovita AI$0.6$2.5Serverless
Kimi K2 0905 PreviewNovita AI$0.6$2.5Serverless
Kimi K2.7-CodeOpenRouter$0.612$3.069Serverless
Kimi K2.6OpenRouter$0.73$3.49Serverless
Kimi K2.6Novita AI$0.8$3.4Serverless
Kimi K2.6Cloudflare Workers AI$0.95$4Serverless
Kimi K2.6Moonshot AI Kimi$0.95$4Serverless
Kimi K2.6Fireworks AI$0.95$4Serverless
Kimi K2.6Vercel AI Gateway$0.95$4Serverless
Kimi K2.7-CodeMoonshot AI Kimi$0.95$4Serverless
Kimi K2 Thinking TurboVercel AI Gateway$1.15$8Serverless
Kimi K2 InstructTogether AI$1.2$4.5Serverless
Kimi K2.6Together AI$1.2$4.5Serverless
Kimi K2.7-Code HighSpeedMoonshot AI Kimi$1.9$8Serverless

Popular comparisons in this family

Frequently Asked Questions

What is Kimi K2 used for?
Kimi K2 is used for code, reasoning, and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Kimi K2 compare to Moonshot?
Kimi K2 by Moonshot AI is strongest where you need code, while Moonshot by Moonshot AI is the closest related family to check for adjacent model selection. Kimi K2 has 11 listed variants and reaches up to 262k context, while Moonshot reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which Kimi K2 model should I use?
For the lowest listed input price, start with Kimi K2 through AWS Bedrock at $0.5/1M input tokens. For the most capable/latest local choice, evaluate Kimi K2.7-Code HighSpeed with 262k context and reasoning, tool use, function calling, structured outputs, and multimodal inputs.