LLM Reference

Xiaomi MiMo-V2-Flash

Released
2025-12-17
Last refreshed
2026-06-29
Status
Researched 62d ago
ProprietaryCommercial use: conditionalRAGAgentsLong contextJSON / Tool use

Xiaomi MiMo-V2-Flash is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 262k context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
MiMo V2
Released
2025-12-17
Context
262k
Parameters
309B
Architecture
Mixture of Experts
Knowledge cutoff
2024-12
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Weights
Not released
Code
Unknown
Training
Pretrained
Created by

Consumer electronics and AI research.

Beijing, China
Founded 2010
Website
Pricing
Output / 1M
$0.300
Input / 1M
$0.100

Cheapest of 2 routes · Novita AI

About

MiMo-V2-Flash is Xiaomi's efficient open-source Mixture-of-Experts model, announced December 17, 2025 at Xiaomi's Human-Car-Home Ecosystem Partner Conference. It has 309B total parameters with 15B active, uses hybrid attention that interleaves Sliding Window Attention and Global Attention, and extends native 32K context to 256K. Multi-Token Prediction enables about 2.6x speculative decoding speedup. The model was distributed with weights on Hugging Face and ranked highly on SWE-Bench Verified and multilingual benchmarks at research time.

Xiaomi MiMo-V2-Flash is a proprietary model in the MiMo V2 family. The structured metadata tracks a 262k-token context window, reasoning, and function calling. This page tracks provider routes through Vercel AI Gateway and Novita AI, with the cheapest tracked route listed at $0.1 input and $0.3 output per 1M tokens. No headline benchmark score is tracked for Xiaomi MiMo-V2-Flash yet.

Top use-case fit: coding, agents, and build tasks

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MCacheRoute
Novita AI$0.100$0.300-
Serverless
Vercel AI Gateway$0.100$0.300read $0.010
Serverless

Capabilities

ReasoningFunction Calling

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Xiaomi MiMo-V2-Flash?

Xiaomi MiMo-V2-Flash has a context window of 262k tokens.

How much does Xiaomi MiMo-V2-Flash cost?

Xiaomi MiMo-V2-Flash is available at $0.1/1M input tokens through Vercel AI Gateway.

When was Xiaomi MiMo-V2-Flash released?

Xiaomi MiMo-V2-Flash was released on 2025-12-17.

Which providers offer Xiaomi MiMo-V2-Flash?

Xiaomi MiMo-V2-Flash is available from 2 providers: Vercel AI Gateway, Novita AI.