What is the context window of Kimi K2.7-Code HighSpeed?

Kimi K2.7-Code HighSpeed has a context window of 262k tokens.

What is the max output of Kimi K2.7-Code HighSpeed?

Kimi K2.7-Code HighSpeed can generate up to 65,536 output tokens.

When was Kimi K2.7-Code HighSpeed released?

Kimi K2.7-Code HighSpeed was released on 2026-06-15.

Which providers offer Kimi K2.7-Code HighSpeed?

Kimi K2.7-Code HighSpeed is available from 1 provider: Moonshot AI Kimi.

Kimi K2.7-Code HighSpeed

Name: Kimi K2.7-Code HighSpeed
Author: Moonshot AI

Released

2026-06-15

Last refreshed

2026-06-20

Status

Researched today

Open sourceCommercial use: permittedMultimodalCodingRAGAgentsLong contextVisionJSON / Tool use

Kimi K2.7-Code HighSpeed is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 1 tracked provider route

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Kimi K2
Released: 2026-06-15
Context: 262k
Max output: 65,536
Parameters: 1T
Architecture: Mixture of Experts
Specialization: code
Openness: Open source
License: MITOSI-approvedCommercial use: permitted

Created by

Moonshot AI

Lossless long-context AI innovation

Beijing, China

Founded 2023

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · Moonshot AI Kimi

Providers(1)

Moonshot AI Kimi

View 1 provider route

Links

Website HuggingFace

About

HighSpeed serving variant of Kimi K2.7-Code optimized for throughput at the cost of latency flexibility. Announced June 15, 2026 — three days after the standard K2.7-Code release. Delivers approximately 180 output tokens per second (up to 260 tokens/s on short-context tasks), around 6× faster than standard K2.7-Code. Same underlying 1T-parameter MoE architecture (32B active, 384 experts, 8 selected per token) with MoonViT vision encoder, 262K context window, and thinking mode always on. Best suited for interactive or latency-bound workflows; the standard variant is preferred for correctness-sensitive long-horizon agentic work.

Kimi K2.7-Code HighSpeed is an open-source model in the Kimi K2 family. The structured metadata tracks a 262k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Moonshot AI Kimi. No headline benchmark score is tracked for Kimi K2.7-Code HighSpeed yet.