LLM Reference

MiMo V2 Models by Xiaomi

XiaomiProprietary
3 models2025–2026Up to 1.05m ctxFrom $0.1/1M input

Details

ResearcherXiaomi
LicenseProprietary
Commercial useCommercial use: conditional
Models3
Released2025–2026
Max context1.05m

Capabilities

Vision1 of 3 models
Multimodal1 of 3 models
Reasoning1 of 3 models
Function Calling1 of 3 models

About

MiMo V2 is a family of 3 AI models by Xiaomi, released between 2025 and 2026.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

3 in view

Use when the workload needs audio, 262k context, and multimodal inputs.

2026-03audio262k contextmultimodal inputs

Use when the workload needs 1.05m context.

2026-031.05m context

Use when the workload needs 262k context, 309B parameters, and reasoning.

2025-12262k context309B parametersreasoning

Release Timeline

2 release groups
2026-03
2 current
MiMo-V2-Omni
audio262k contextmultimodal inputs
Current
MiMo-V2-Pro
1.05m context
Current
2025-12
1 current
Xiaomi MiMo-V2-Flash
262k context309B parametersreasoning
Current

Specifications(3 models)

MiMo V2 model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningFn Calling
MiMo-V2-Omni2026-03262kYesYesNoNo
MiMo-V2-Pro2026-031.05mNoNoNoNo
Xiaomi MiMo-V2-Flash2025-12262k309BNoNoYesYes

Available From(3 providers)

Pricing

MiMo V2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Xiaomi MiMo-V2-FlashVercel AI Gateway$0.1$0.3Serverless
Xiaomi MiMo-V2-FlashNovita AI$0.1$0.3Serverless
MiMo-V2-OmniOpenRouter$0.4$2Serverless
MiMo-V2-ProOpenRouter$1$3Serverless
MiMo-V2-ProVercel AI Gateway$1$3Serverless

Frequently Asked Questions

What is MiMo V2 used for?
MiMo V2 is used for audio, vision and multimodal work, and reasoning. The family description and listed model capabilities point to those workloads as the best fit.
How does MiMo V2 compare to MiMo?
MiMo V2 by Xiaomi is strongest where you need audio, while MiMo by Xiaomi is the closest related family to check for text to speech. MiMo V2 has 3 listed variants and reaches up to 1.05m context, while MiMo reaches up to 1.05m context, so compare the specs and pricing tables before choosing a production model.
Which MiMo V2 model should I use?
For the lowest listed input price, start with Xiaomi MiMo-V2-Flash through Novita AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate MiMo-V2-Omni with 262k context and multimodal inputs.