MiMo V2 Models by Xiaomi
XiaomiProprietary
3 models2025–2026Up to 1.05m ctxFrom $0.1/1M input
Details
ResearcherXiaomi
LicenseProprietary
Commercial useCommercial use: conditional
Models3
Released2025–2026
Max context1.05m
Capabilities
Vision1 of 3 models
Multimodal1 of 3 models
Reasoning1 of 3 models
Function Calling1 of 3 models
About
MiMo V2 is a family of 3 AI models by Xiaomi, released between 2025 and 2026.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
3 in view
MiMo-V2-OmniCurrent
Use when the workload needs audio, 262k context, and multimodal inputs.
2026-03audio262k contextmultimodal inputs
Xiaomi MiMo-V2-FlashCurrent
Use when the workload needs 262k context, 309B parameters, and reasoning.
2025-12262k context309B parametersreasoning
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| MiMo-V2-Omni | Use when the workload needs audio, 262k context, and multimodal inputs. | 2026-03 | audio262k contextmultimodal inputs | Current |
| MiMo-V2-Pro | Use when the workload needs 1.05m context. | 2026-03 | 1.05m context | Current |
| Xiaomi MiMo-V2-Flash | Use when the workload needs 262k context, 309B parameters, and reasoning. | 2025-12 | 262k context309B parametersreasoning | Current |
Release Timeline
2 release groups2026-03
2 current
MiMo-V2-Omni
Currentaudio262k contextmultimodal inputs
MiMo-V2-Pro
Current1.05m context
2025-12
1 current
Xiaomi MiMo-V2-Flash
Current262k context309B parametersreasoning
Specifications(3 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Reasoning | Fn Calling |
|---|---|---|---|---|---|---|---|
| MiMo-V2-Omni | 2026-03 | 262k | — | Yes | Yes | No | No |
| MiMo-V2-Pro | 2026-03 | 1.05m | — | No | No | No | No |
| Xiaomi MiMo-V2-Flash | 2025-12 | 262k | 309B | No | No | Yes | Yes |
Available From(3 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Xiaomi MiMo-V2-Flash | Vercel AI Gateway | $0.1 | $0.3 | Serverless |
| Xiaomi MiMo-V2-Flash | Novita AI | $0.1 | $0.3 | Serverless |
| MiMo-V2-Omni | OpenRouter | $0.4 | $2 | Serverless |
| MiMo-V2-Pro | OpenRouter | $1 | $3 | Serverless |
| MiMo-V2-Pro | Vercel AI Gateway | $1 | $3 | Serverless |
Frequently Asked Questions
- What is MiMo V2 used for?
- MiMo V2 is used for audio, vision and multimodal work, and reasoning. The family description and listed model capabilities point to those workloads as the best fit.
- How does MiMo V2 compare to MiMo?
- MiMo V2 by Xiaomi is strongest where you need audio, while MiMo by Xiaomi is the closest related family to check for text to speech. MiMo V2 has 3 listed variants and reaches up to 1.05m context, while MiMo reaches up to 1.05m context, so compare the specs and pricing tables before choosing a production model.
- Which MiMo V2 model should I use?
- For the lowest listed input price, start with Xiaomi MiMo-V2-Flash through Novita AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate MiMo-V2-Omni with 262k context and multimodal inputs.

