Using MiniMax M2.5 Highspeed on MiniMax

Implementation guide · MiniMax M2 · MiniMax

ServerlessOpen Source

Quick Start

1
Create an account at MiniMax and generate an API key.
2
Use the MiniMax SDK or REST API to call MiniMax-M2.5-highspeed — see the documentation for request format.

API Portal Documentation Model Card

Code Examples

See MiniMax documentation for integration details.

About MiniMax

MiniMax is a multimodal foundation model and API platform for text, speech, video, image, and music generation with agent tools.

View all models on MiniMax →

Pricing on MiniMax

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

About MiniMax M2.5 Highspeed

MiniMax M2.5 Highspeed is MiniMax's inference-optimized variant of M2.5, released simultaneously in February 2026. It delivers identical intelligence and outputs to standard M2.5 through a specialized inference engine at lower latency. The model supports a 204,800-token context window, 131,072-token max output, function calling, structured output, and reasoning. API model ID: MiniMax-M2.5-highspeed. It is designed for latency-sensitive interactive applications and automated agent pipelines.

Full model details →

Model Specs

Released2026-02-12

Parameters230B (10B active)

Context205k

ArchitectureDecoder Only