LLM ReferenceLLM Reference

Kimi K2 Models by Moonshot AI

8 models2025–2026Up to 262K ctxFrom $0.5/1M input

About

Kimi K2 is Moonshot AI's frontier MoE model family for agentic coding and long-context reasoning. The line spans the original K2 release and Kimi K2.6, with 1T total parameters, 32B active parameters, a 262K-token context window, and support for coding, tool use, and vision-assisted workflows.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view
Kimi K2.6Current

Use when the workload needs code, 262K context, and 1000B parameters.

2026-04code262K context1000B parameters

Use when the workload needs 262K context.

2025-11262K context

Use when the workload needs 262K context, 1K parameters, and function calling.

2025-09262K context1K parametersfunction calling

Use when the workload needs 262K context, 1K parameters, and function calling.

2025-08262K context1K parametersfunction calling
Kimi K2Current

Use when the workload needs 262K context, 1K parameters, and function calling.

2025-07262K context1K parametersfunction calling

Use when the workload needs 131K context, 1K parameters, and function calling.

2025-07131K context1K parametersfunction calling

Use when the workload needs 256K context.

2025-01256K context

Use when the workload needs reasoning, 256K context, and structured outputs.

2025-01reasoning256K contextstructured outputs

Release Timeline

6 release groups
2026-04
1 current
Kimi K2.6
code262K context1000B parameters
Current
2025-11
1 current
Current
2025-09
1 current
Kimi K2 0905 Preview
262K context1K parametersfunction calling
Current
2025-08
1 current
Kimi K2 Turbo Preview
262K context1K parametersfunction calling
Current
2025-07
2 current
Kimi K2
262K context1K parametersfunction calling
Current
Kimi K2 0711 Preview
131K context1K parametersfunction calling
Current
2025-01
2 current
Current
Kimi K2 Thinking
reasoning256K contextstructured outputs
Current

Specifications(8 models)

Kimi K2 model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningFn CallingTool UseStructured Outputs
Kimi K2.62026-04262K1TYesYesYesYesYesNo
Kimi K2 Thinking Turbo2025-11262KNoNoNoNoNoNo
Kimi K2 0905 Preview2025-09262K1KNoNoNoYesNoNo
Kimi K2 Turbo Preview2025-08262K1KNoNoNoYesNoNo
Kimi K22025-07262K1KNoNoNoYesNoYes
Kimi K2 0711 Preview2025-07131K1KNoNoNoYesNoNo
Kimi K2 Instruct 09052025-01256KNoNoNoNoNoNo
Kimi K2 Thinking2025-01256KNoNoYesNoNoYes

Available From(7 providers)

Pricing

Kimi K2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Kimi K2AWS Bedrock$0.5$2Serverless
Kimi K2GCP Vertex AI$0.5$2Serverless
Kimi K2OpenRouter$0.57$2.3Serverless
Kimi K2 Instruct 0905Fireworks AI$0.6$2.5Serverless
Kimi K2 ThinkingFireworks AI$0.6$2.5Serverless
Kimi K2 ThinkingGCP Vertex AI$0.6$2.5Serverless
Kimi K2 ThinkingAWS Bedrock$0.6$2.5Serverless
Kimi K2 ThinkingOpenRouter$0.6$2.5Serverless
Kimi K2.6OpenRouter$0.75$3.5Serverless
Kimi K2.6Moonshot AI Kimi$0.9$3.72Serverless
Kimi K2.6Fireworks AI$0.95$4Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is Kimi K2 used for?
Kimi K2 is used for code, reasoning, and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Kimi K2 compare to Moonshot?
Kimi K2 by Moonshot AI is strongest where you need code, while Moonshot by Moonshot AI is the closest related family to check for adjacent model selection. Kimi K2 has 8 listed variants and reaches up to 262K context, while Moonshot reaches up to 128K context, so compare the specs and pricing tables before choosing a production model.
Which Kimi K2 model should I use?
For the lowest listed input price, start with Kimi K2 through AWS Bedrock at $0.5/1M input tokens. For the most capable/latest local choice, evaluate Kimi K2.6 with 262K context and reasoning, tool use, function calling, and multimodal inputs.

Models(8)