Xiaomi MiMo-V2-Flash

Name: Xiaomi MiMo-V2-Flash
Author: Xiaomi

Released

2025-12-17

Last refreshed

2026-06-29

Status

Researched 62d ago

ProprietaryCommercial use: conditionalRAGAgentsLong contextJSON / Tool use

Xiaomi MiMo-V2-Flash is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

Teams evaluating rag, agents, and long context
Workloads that can use a 262k context window
Buyers comparing 2 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: MiMo V2
Released: 2025-12-17
Context: 262k
Parameters: 309B
Architecture: Mixture of Experts
Knowledge cutoff: 2024-12
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Weights: Not released
Code: Unknown
Training: Pretrained

Created by

Xiaomi

Consumer electronics and AI research.

Beijing, China

Founded 2010

Website

Pricing

Output / 1M

$0.300

Input / 1M

$0.100

Cheapest of 2 routes · Novita AI

Providers(2)

Vercel AI Gateway Novita AI

View 2 provider routes

Links

Website HuggingFace

About

MiMo-V2-Flash is Xiaomi's efficient open-source Mixture-of-Experts model, announced December 17, 2025 at Xiaomi's Human-Car-Home Ecosystem Partner Conference. It has 309B total parameters with 15B active, uses hybrid attention that interleaves Sliding Window Attention and Global Attention, and extends native 32K context to 256K. Multi-Token Prediction enables about 2.6x speculative decoding speedup. The model was distributed with weights on Hugging Face and ranked highly on SWE-Bench Verified and multilingual benchmarks at research time.

Xiaomi MiMo-V2-Flash is a proprietary model in the MiMo V2 family. The structured metadata tracks a 262k-token context window, reasoning, and function calling. This page tracks provider routes through Vercel AI Gateway and Novita AI, with the cheapest tracked route listed at $0.1 input and $0.3 output per 1M tokens. No headline benchmark score is tracked for Xiaomi MiMo-V2-Flash yet.

Top use-case fit: coding, agents, and build tasks

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
Novita AI	$0.100	$0.300	-	Serverless
Vercel AI Gateway	$0.100	$0.300	read $0.010	Serverless

Capabilities

ReasoningFunction Calling

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Xiaomi MiMo-V2-Flash?

Xiaomi MiMo-V2-Flash has a context window of 262k tokens.

How much does Xiaomi MiMo-V2-Flash cost?

Xiaomi MiMo-V2-Flash is available at $0.1/1M input tokens through Vercel AI Gateway.

When was Xiaomi MiMo-V2-Flash released?

Xiaomi MiMo-V2-Flash was released on 2025-12-17.

Which providers offer Xiaomi MiMo-V2-Flash?

Xiaomi MiMo-V2-Flash is available from 2 providers: Vercel AI Gateway, Novita AI.

Created by

Xiaomi

Consumer electronics and AI research.

Beijing, China

Founded 2010

Website

Pricing

Output / 1M

$0.300

Input / 1M

$0.100

Cheapest of 2 routes · Novita AI

Providers(2)

Vercel AI Gateway Novita AI

View 2 provider routes

Links

Website HuggingFace