Kimi K2.6

Name: Kimi K2.6
Author: Moonshot AI

Released

2026-04-20

Last refreshed

2026-06-30

Status

Researched 18d ago

Open sourceCommercial use: permittedMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Kimi K2.6 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Kimi K2
Released: 2026-04-20
Context: 262k
Max output: 65,536
Parameters: 1T
Architecture: Mixture of Experts
Knowledge cutoff: 2025-04
Specialization: code
Openness: Open source
License: MITOSI-approvedCommercial use: permitted
Weights: Available·Moonshot Modified MIT License
Code: Available·Moonshot Modified MIT License

Created by

Moonshot AI

Lossless long-context AI innovation

Beijing, China

Founded 2023

Website

Pricing

Output / 1M

$3.40

Input / 1M

$0.800

Cheapest of 9 routes · Novita AI

Providers(9)

Cloudflare Workers AI NVIDIA NIM Moonshot AI Kimi Fireworks AI OpenRouter Microsoft Foundry Vercel AI Gateway Novita AI Together AI

View 9 provider routes

Links

Website HuggingFace

About

Kimi K2.6 is Moonshot AI's multimodal agentic coding model, released April 20 2026 under a Modified MIT license. Built on a 1-trillion-parameter MoE architecture (32B active, 384 experts with 8 selected per token plus 1 shared expert, 61 layers), it features a 262K context window and up to 65,536 output tokens. Supports native image and video inputs (screenshots, PDFs, spreadsheets). Designed for long-horizon coding with agent swarms of up to 300 sub-agents and 4,000 coordinated steps; Moonshot AI cites 200–300 sequential tool calls without task drift. Key benchmarks: SWE-bench Verified 80.2%, SWE-bench Pro 58.6%, LiveCodeBench v6 89.6%, GPQA Diamond 90.5%, Terminal-Bench 2.0 66.7%. Chatbot Arena Elo 1454 (2026-04-28 snapshot).

Kimi K2.6 is an open-source model in the Kimi K2 family. The structured metadata tracks a 262k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Cloudflare Workers AI, NVIDIA NIM, Moonshot AI Kimi, and 6 more, with the cheapest tracked route listed at $0.73 input and $3.49 output per 1M tokens. Headline tracked benchmarks include SWE-bench Verified 80.2, Chatbot Arena 1462.0, and Google-Proof Q&A 90.5.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

4 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ C

1 relevant benchmark in the decision map.

Provider price ladder

Compare all 9

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Novita AI	$0.800	$3.40	Serverless
OpenRouter	$0.730	$3.49	Serverless
Cloudflare Workers AI	$0.950	$4.00	Serverless
Fireworks AI	$0.950	$4.00	Serverless

Available via routers & gateways(7)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughTogether AIFireworks AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionMicrosoft Foundry

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft Foundry

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionMicrosoft Foundry

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsPrompt Caching

Benchmark peer barsfor Coding

SWE-bench ProRank 12 of 41

80.3

69.2

64.7

64.6

58.6

SWE-bench VerifiedRank 15 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Opus 4.7

87.6

Kimi K2.6current

80.2

HumanEvalRank 12 of 97

Claude Sonnet 4.6

98.0

96.7

Claude Opus 4.6

95.0

Grok-3

94.5

Kimi K2.6current

92.0

Benchmark scores(17)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Evaluation	Source
SWE-bench Verified	80.2	SWE-bench VerifiedObserved 2026-04-24	—	Source
Chatbot Arena	1462.0	—Observed 2026-05-17	—	Source
Google-Proof Q&A	90.5	GPQA Diamond (accuracy)Observed 2026-06-07	—	Source
MMLU PRO	84.6	From Kimi K2 (accuracy)Observed 2026-06-07	—	Source
HumanEval	92.0	—Observed 2026-04-29	—	Source
SWE-bench Pro	58.6	SWE-bench Pro (resolved%)Observed 2026-06-07	—	Source
Instruction-Following Evaluation	89.8	—Observed 2026-04-29	—	Source
MMMU Pro	80.1	LLM-Stats aggregatorObserved 2026-06-07	—	Source
LiveCodeBench	89.6	v6Observed 2026-04-20	—	Source
Terminal-Bench 2.0	66.7	Terminal-Bench 2.0 (accuracy%)Observed 2026-06-07	—	Source
SWE-bench Multilingual	76.7	—Observed 2026-04-20	—	Source
AIME 2026	96.4	AIME 2026 (accuracy)Observed 2026-06-07	—	Source
BrowseComp	83.2	BrowseComp (accuracy%)Observed 2026-06-07	—	Source
Humanity's Last Exam	34.7	HLE-Full without tools (accuracy)Observed 2026-06-07	—	Source
MCP-Atlas	55.9	MCP-Atlas (accuracy%)Observed 2026-06-07	—	Source
CursorBench	47.6	CursorBench 3.1Observed 2026-06-30	Configuration: Kimi 2.6 (single reported configuration) Harness: CursorBench 3.1 Evaluator: Cursor Confidence: confirmed Notes: Cursor published one CursorBench 3.1 configuration for this model; no cross-effort selection was needed.	Source
GeneBench-Pro	4.4	xhighObserved 2026-06-30	—	Source

Compare Kimi K2.6 to GPT-5.5 head-to-head for benchmarks, pricing, and full specs side by side. On NVIDIA NIM, see pricing and specs or the setup guide (catalog ID moonshotai/kimi-k2.6).

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)

Best for Open weightsStrong Best LLMs for Code GenerationListed Best AI Agent Models 2026: SWE-bench RankedListed Best Open Source LLMsListed Best LLMs for Reasoning & MathListed Best Mainstream LLM APIs, RankedListed

Compare Kimi K2.6 with other models

Comparison and alternatives

Browse all comparisons →

Kimi K2.6 vs GLM-5.2 Kimi K2.6 vs Claude Sonnet 5 Kimi K2.6 vs DeepSeek V4 Flash Kimi K2.6 vs Claude Opus 4.8 Kimi K2.6 vs Kimi K2.5 Kimi K2.6 vs Claude Opus 4.7 Kimi K2.6 vs DeepSeek V4 Pro Kimi K2.6 vs GPT-5.2 Codex Kimi K2.6 vs GPT-5.1 Codex Max Kimi K2.6 vs Gemini 3.1 Pro Preview Kimi K2.6 vs GPT-5.5 Kimi K2.6 vs GPT-5.5 Pro Kimi K2.6 vs Grok 4 Kimi K2.6 vs Claude Sonnet 4.6 Kimi K2.6 vs Composer 2.5 Kimi K2.6 vs Step 3.7 Flash

Show all 32 popular comparisonssorted by 7-day search impressions

Kimi K2.6 vs DeepSeek R12K Kimi K2.6 vs Grok 4.32K Kimi K2.6 vs DeepSeek V32K Kimi K2.6 vs GPT-5.41K Kimi K2.6 vs DeepSeek V3.11K Kimi K2.6 vs Claude Sonnet 4.5816 Kimi K2.6 vs Grok-3663 Kimi K2.6 vs GPT-5.5 Instant538 Kimi K2.6 vs Claude Opus 4.5314 Kimi K2.6 vs Grok 4.20289 Kimi K2.6 vs o3 Deep Research221 Kimi K2.6 vs o3219 Kimi K2.6 vs Claude Opus 4.6183 Kimi K2.6 vs o3-pro158 Kimi K2.6 vs Gemini 3.1 Pro Preview Custom Tools95 Kimi K2.6 vs Llama 3 8B Instruct60 Kimi K2.6 vs Mistral Nemotron45 Kimi K2.6 vs Gemini 3.1 Flash-Lite39 Kimi K2.6 vs GPT-5.1 Codex36 Kimi K2.6 vs Phi-4 Reasoning Vision 15B32 Kimi K2.6 vs Claude Mythos Preview31 Kimi K2.6 vs Llama Guard 3 1B28 Kimi K2.6 vs GPT-5.123 Kimi K2.6 vs Llama Guard 4 12B18 Kimi K2.6 vs GPT-5.3-Codex-Spark17 Kimi K2.6 vs Mistral Magistral Small 250913 Kimi K2.6 vs Phi-4 Mini Flash Reasoning10 Kimi K2.6 vs Composer 27 Kimi K2.6 vs Magistral Small 25064 Kimi K2.6 vs Claude Haiku 4.53 Kimi K2.6 vs GPT-5.4-Cyber1 Kimi K2.6 vs GPT-5.4 Pro1

Frequently asked questions

What is the context window of Kimi K2.6?

Kimi K2.6 has a context window of 262k tokens.

What is the max output of Kimi K2.6?

Kimi K2.6 can generate up to 65,536 output tokens.

How much does Kimi K2.6 cost?

Kimi K2.6 pricing ranges from $0.73/1M to $1.20/1M input tokens depending on the provider.

When was Kimi K2.6 released?

Kimi K2.6 was released on 2026-04-20.

Which providers offer Kimi K2.6?

Kimi K2.6 is available from 9 providers: Cloudflare Workers AI, NVIDIA NIM, Moonshot AI Kimi, Fireworks AI, OpenRouter, Microsoft Foundry, Vercel AI Gateway, Novita AI, Together AI.

What benchmarks has Kimi K2.6 been tested on?

Kimi K2.6 has been evaluated on 17 benchmarks, including SWE-bench Verified, Chatbot Arena, Google-Proof Q&A, MMLU PRO, HumanEval.

Created by

Moonshot AI

Lossless long-context AI innovation

Beijing, China

Founded 2023

Website

Pricing

Output / 1M

$3.40

Input / 1M