Xiaomi MiMo-V2-Flash
Xiaomi MiMo-V2-Flash is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.
Use it for
- Teams evaluating rag, agents, and long context
- Workloads that can use a 262k context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- MiMo V2
- Released
- 2025-12-17
- Context
- 262k
- Parameters
- 309B
- Architecture
- Mixture of Experts
- Knowledge cutoff
- 2024-12
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
- Weights
- Not released
- Code
- Unknown
- Training
- Pretrained
Cheapest of 2 routes · Novita AI
About
MiMo-V2-Flash is Xiaomi's efficient open-source Mixture-of-Experts model, announced December 17, 2025 at Xiaomi's Human-Car-Home Ecosystem Partner Conference. It has 309B total parameters with 15B active, uses hybrid attention that interleaves Sliding Window Attention and Global Attention, and extends native 32K context to 256K. Multi-Token Prediction enables about 2.6x speculative decoding speedup. The model was distributed with weights on Hugging Face and ranked highly on SWE-Bench Verified and multilingual benchmarks at research time.
Xiaomi MiMo-V2-Flash is a proprietary model in the MiMo V2 family. The structured metadata tracks a 262k-token context window, reasoning, and function calling. This page tracks provider routes through Vercel AI Gateway and Novita AI, with the cheapest tracked route listed at $0.1 input and $0.3 output per 1M tokens. No headline benchmark score is tracked for Xiaomi MiMo-V2-Flash yet.
Top use-case fit: coding, agents, and build tasks
RAG
Included by capability and metadata signals in the decision map.
Agents
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 2Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Cache | Route |
|---|---|---|---|---|
| Novita AI | $0.100 | $0.300 | - | Serverless |
| Vercel AI Gateway | $0.100 | $0.300 | read $0.010 | Serverless |
Capabilities
Benchmark peer barsfor RAG
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Frequently asked questions
What is the context window of Xiaomi MiMo-V2-Flash?
Xiaomi MiMo-V2-Flash has a context window of 262k tokens.
How much does Xiaomi MiMo-V2-Flash cost?
Xiaomi MiMo-V2-Flash is available at $0.1/1M input tokens through Vercel AI Gateway.
When was Xiaomi MiMo-V2-Flash released?
Xiaomi MiMo-V2-Flash was released on 2025-12-17.
Which providers offer Xiaomi MiMo-V2-Flash?
Xiaomi MiMo-V2-Flash is available from 2 providers: Vercel AI Gateway, Novita AI.
Cheapest of 2 routes · Novita AI