MiniMax M2.5 Highspeed on Vercel AI Gateway

Name: MiniMax M2.5 Highspeed on Vercel AI Gateway
Brand: MiniMax
SKU: minimax-m2.5-highspeed-vercel-ai-gateway
Price: 0.6 USD

MiniMax M2 · MiniMax

ServerlessOpen Source

Last refreshed 2026-06-29. Next refresh: weekly.

Why use MiniMax M2.5 Highspeed on Vercel AI Gateway?

Vercel AI Gateway offers MiniMax M2.5 Highspeed with pay-as-you-go pricing at $0.60/1M input tokens. Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more.

Compare MiniMax M2.5 Highspeed across 3 providers to find the best fit for your use case

Input / 1M

$0.60

Output / 1M

$2.40

Cache

read $0.030

Batch

Not sourced

Setup recipe

Python + curl

Install

pip install openai

Auth

export AI_GATEWAY_API_KEY=...

Call

import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],

Model ID

minimax/minimax-m2.5-highspeed

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
    base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
    model="minimax/minimax-m2.5-highspeed",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

Use provider model ID "minimax/minimax-m2.5-highspeed", not the LLMReference slug "minimax-m2.5-highspeed".
creator/model-name e.g. kwaipilot/kat-coder-pro-v2
The examples expect AI_GATEWAY_API_KEY; rename it only if your application config maps the new variable.

Compare MiniMax M2.5 Highspeed Across Providers

Provider	Input (per 1M)	Output (per 1M)
MiniMax	—	—
Vercel AI Gateway	$0.60	$2.40
Novita AI	$0.60	$2.40

Pricing

Type	Price (per 1M)
Input tokens	$0.60
Output tokens	$2.40

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

About MiniMax M2.5 Highspeed

MiniMax M2.5 Highspeed is MiniMax's inference-optimized variant of M2.5, released simultaneously in February 2026. It delivers identical intelligence and outputs to standard M2.5 through a specialized inference engine at lower latency. The model supports a 204,800-token context window, 131,072-token max output, function calling, structured output, and reasoning. API model ID: MiniMax-M2.5-highspeed. It is designed for latency-sensitive interactive applications and automated agent pipelines.

FAQ

What does MiniMax M2.5 Highspeed cost on Vercel AI Gateway?

On Vercel AI Gateway, MiniMax M2.5 Highspeed costs $0.6 per 1M input tokens and $2.4 per 1M output tokens.

What is the context window for MiniMax M2.5 Highspeed on Vercel AI Gateway?

MiniMax M2.5 Highspeed supports a 205k token context window on Vercel AI Gateway.

How does Vercel AI Gateway compare to other MiniMax M2.5 Highspeed providers?

MiniMax M2.5 Highspeed is available from 3 providers. The cheapest input pricing is $0.6/1M tokens from Vercel AI Gateway.

What API model ID do I use for MiniMax M2.5 Highspeed on Vercel AI Gateway?

Use the model ID minimax/minimax-m2.5-highspeed when calling Vercel AI Gateway's API.

Who created MiniMax M2.5 Highspeed?

MiniMax M2.5 Highspeed was created by MiniMax as part of the MiniMax M2 model family.

Is MiniMax M2.5 Highspeed open source?

MiniMax M2.5 Highspeed is open source under MIT according to the seed data.

Get Started

Model Card Docs Playground Pricing

Model Specs

Released2026-02-12

Parameters230B (10B active)

Context205k

ArchitectureDecoder Only

Vercel

All models on Vercel AI Gateway →Provider setup guide →