LLM ReferenceLLM Reference
MiniMax

Using MiniMax M2.5 Highspeed on MiniMax

Implementation guide · MiniMax M2 · MiniMax

Serverless

Quick Start

  1. 1
    Create an account at MiniMax and generate an API key.
  2. 2
    Use the MiniMax SDK or REST API to call MiniMax-M2.5-highspeed — see the documentation for request format.

Code Examples

See MiniMax documentation for integration details.

About MiniMax

MiniMax is a multimodal foundation model and API platform for text, speech, video, image, and music generation with agent tools.

Pricing on MiniMax

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About MiniMax M2.5 Highspeed

MiniMax M2.5 Highspeed is MiniMax's inference-optimized variant of M2.5, released simultaneously in February 2026. It delivers identical intelligence and outputs to standard M2.5 through a specialized inference engine at lower latency. The model supports a 204,800-token context window, 131,072-token max output, function calling, structured output, and reasoning. API model ID: MiniMax-M2.5-highspeed. It is designed for latency-sensitive interactive applications and automated agent pipelines.

Model Specs

Released2026-02-12
Context205K
ArchitectureDecoder Only

Provider

MiniMax
MiniMax