LLM Reference

Kimi K2.7-Code HighSpeed

Released
2026-06-15
Last refreshed
2026-06-20
Status
Researched today
Open sourceCommercial use: permittedMultimodalCodingRAGAgentsLong contextVisionJSON / Tool use

Kimi K2.7-Code HighSpeed is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 262k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Workloads where another current model has stronger sourced task evidence
Specifications
Family
Kimi K2
Released
2026-06-15
Context
262k
Max output
65,536
Parameters
1T
Architecture
Mixture of Experts
Specialization
code
Openness
Open source
License
MITOSI-approvedCommercial use: permitted
Created by

Lossless long-context AI innovation

Beijing, China
Founded 2023
Website
Pricing
Output / 1M
-
Input / 1M
-

Cheapest of 1 route · Moonshot AI Kimi

About

HighSpeed serving variant of Kimi K2.7-Code optimized for throughput at the cost of latency flexibility. Announced June 15, 2026 — three days after the standard K2.7-Code release. Delivers approximately 180 output tokens per second (up to 260 tokens/s on short-context tasks), around 6× faster than standard K2.7-Code. Same underlying 1T-parameter MoE architecture (32B active, 384 experts, 8 selected per token) with MoonViT vision encoder, 262K context window, and thinking mode always on. Best suited for interactive or latency-bound workflows; the standard variant is preferred for correctness-sensitive long-horizon agentic work.

Kimi K2.7-Code HighSpeed is an open-source model in the Kimi K2 family. The structured metadata tracks a 262k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Moonshot AI Kimi. No headline benchmark score is tracked for Kimi K2.7-Code HighSpeed yet.

Top use-case fit: coding, agents, and build tasks

Coding

Included by capability and metadata signals in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Moonshot AI Kimi--
ServerlessPartial

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsPrompt Caching

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.