What is the context window of MiniMax M2.7 Highspeed?

MiniMax M2.7 Highspeed has a context window of 205k tokens.

What is the max output of MiniMax M2.7 Highspeed?

MiniMax M2.7 Highspeed can generate up to 131,072 output tokens.

How much does MiniMax M2.7 Highspeed cost?

MiniMax M2.7 Highspeed is available at $0.6/1M input tokens through Vercel AI Gateway.

When was MiniMax M2.7 Highspeed released?

MiniMax M2.7 Highspeed was released on 2026-03-18.

Which providers offer MiniMax M2.7 Highspeed?

MiniMax M2.7 Highspeed is available from 2 providers: MiniMax, Vercel AI Gateway.

What benchmarks has MiniMax M2.7 Highspeed been tested on?

MiniMax M2.7 Highspeed has been evaluated on 2 benchmarks, including Google-Proof Q&A, SWE-bench Pro.

MiniMax M2.7 Highspeed

Name: MiniMax M2.7 Highspeed
Author: MiniMax

Released

2026-03-18

Last refreshed

2026-06-15

Status

Researched 53d ago

Open sourceCommercial use: permittedCodingRAGAgentsLong contextClassificationJSON / Tool use

MiniMax M2.7 Highspeed is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 205k context window
Buyers comparing 2 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: MiniMax M2
Released: 2026-03-18
Context: 205k
Max output: 131,072
Parameters: 10B active
Architecture: Decoder Only
Specialization: general
Openness: Open source
License: MITOSI-approvedCommercial use: permitted

Created by

MiniMax

Developing AI for gaming and entertainment.

Minhang, Shanghai, China

Founded 2021

Website

Pricing

Output / 1M

$2.40

Input / 1M

$0.600

Cheapest of 2 routes · Vercel AI Gateway · cache read $0.060

Providers(2)

MiniMax Vercel AI Gateway

View 2 provider routes

Links

Website

About

MiniMax M2.7 Highspeed is the inference-optimized variant of MiniMax M2.7, released simultaneously on March 18, 2026. It reaches 100 tokens per second output speed, about 66% faster than standard M2.7, while preserving identical intelligence and outputs through engine optimization rather than weight changes. It supports a 204,800-token context window, 131,072-token max output, function calling, structured output, and reasoning. API model ID: MiniMax-M2.7-highspeed.

MiniMax M2.7 Highspeed is an open-source model in the MiniMax M2 family. The structured metadata tracks a 205k-token context window, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through MiniMax and Vercel AI Gateway, with the cheapest tracked route listed at $0.6 input and $2.4 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 87.4 and SWE-bench Pro 56.2.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

1 relevant benchmark in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
Vercel AI Gateway	$0.600	$2.40	read $0.060	Serverless
MiniMax	-	-	-	ServerlessPartial

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

SWE-bench ProRank 18 of 38

80.3

73.7

69.2

64.3

MiniMax M2.7 Highspeedcurrent

56.2

Benchmark scores(2)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Google-Proof Q&A	87.4	GPQA Diamond (accuracy%)	https://www.minimax.io/news/minimax-m27-en
SWE-bench Pro	56.2	SWE-Pro (resolved%)	https://www.minimax.io/news/minimax-m27-en