LLM Reference

MiniMax M2.5 Highspeed on Vercel AI Gateway

MiniMax M2 · MiniMax

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use MiniMax M2.5 Highspeed on Vercel AI Gateway?

Vercel AI Gateway offers MiniMax M2.5 Highspeed with pay-as-you-go pricing at $0.60/1M input tokens. Vercel AI Gateway is a unified AI proxy providing a single OpenAI-compatible API endpoint to 275+ models from 25+ providers including Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, xAI, Alibaba, Amazon, ByteDance, Cohere, MiniMax, MoonshotAI, KwaiPilot, Black Forest Labs, Recraft, Voyage AI, NVIDIA, and more.

Compare MiniMax M2.5 Highspeed across 3 providers to find the best fit for your use case
Input / 1M
$0.60
Output / 1M
$2.40
Cache
read $0.030
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export AI_GATEWAY_API_KEY=...
Call
import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
Model ID
minimax/minimax-m2.5-highspeed

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["AI_GATEWAY_API_KEY"],
    base_url="https://ai-gateway.vercel.sh/v1"
)
response = client.chat.completions.create(
    model="minimax/minimax-m2.5-highspeed",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • Use provider model ID "minimax/minimax-m2.5-highspeed", not the LLMReference slug "minimax-m2.5-highspeed".
  • creator/model-name e.g. kwaipilot/kat-coder-pro-v2
  • The examples expect AI_GATEWAY_API_KEY; rename it only if your application config maps the new variable.

Compare MiniMax M2.5 Highspeed Across Providers

ProviderInput (per 1M)Output (per 1M)
MiniMax
Vercel AI Gateway$0.60$2.40
Novita AI$0.60$2.40

Pricing

TypePrice (per 1M)
Input tokens$0.60
Output tokens$2.40

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

About MiniMax M2.5 Highspeed

MiniMax M2.5 Highspeed is MiniMax's inference-optimized variant of M2.5, released simultaneously in February 2026. It delivers identical intelligence and outputs to standard M2.5 through a specialized inference engine at lower latency. The model supports a 204,800-token context window, 131,072-token max output, function calling, structured output, and reasoning. API model ID: MiniMax-M2.5-highspeed. It is designed for latency-sensitive interactive applications and automated agent pipelines.

FAQ

What does MiniMax M2.5 Highspeed cost on Vercel AI Gateway?

On Vercel AI Gateway, MiniMax M2.5 Highspeed costs $0.6 per 1M input tokens and $2.4 per 1M output tokens.

What is the context window for MiniMax M2.5 Highspeed on Vercel AI Gateway?

MiniMax M2.5 Highspeed supports a 204,800 token context window on Vercel AI Gateway.

How does Vercel AI Gateway compare to other MiniMax M2.5 Highspeed providers?

MiniMax M2.5 Highspeed is available from 3 providers. The cheapest input pricing is $0.6/1M tokens from Vercel AI Gateway.

What API model ID do I use for MiniMax M2.5 Highspeed on Vercel AI Gateway?

Use the model ID minimax/minimax-m2.5-highspeed when calling Vercel AI Gateway's API.

Who created MiniMax M2.5 Highspeed?

MiniMax M2.5 Highspeed was created by MiniMax as part of the MiniMax M2 model family.

Is MiniMax M2.5 Highspeed open source?

MiniMax M2.5 Highspeed is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2026-02-12
Parameters230B (10B active)
Context205K
ArchitectureDecoder Only