LLM Reference

MiMo-V2-Omni

Released
2026-03-18
Last refreshed
2026-06-15
Status
Researched 45d ago
ProprietaryCommercial use: conditionalMultimodalLong contextVision

MiMo-V2-Omni is worth evaluating for long context and vision when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context and vision
  • Workloads that can use a 262k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Strict JSON or tool-calling flows
Specifications
Family
MiMo V2
Released
2026-03-18
Context
262k
Knowledge cutoff
2024-12
Specialization
audio
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Created by

Consumer electronics and AI research.

Beijing, China
Founded 2010
Website
Pricing
Output / 1M
$2.00
Input / 1M
$0.400

Cheapest of 1 route · OpenRouter

About

Xiaomi MiMo-V2-Omni multimodal language model. Part of the MiMo V2 series; the Omni variant adds multimodal (image) understanding. Distinct from MiMo V2.5 which focuses on math reasoning.

MiMo-V2-Omni is a proprietary model in the MiMo V2 family. The structured metadata tracks a 262k-token context window, multimodal input, and audio. This page tracks provider routes through OpenRouter, with the cheapest tracked route listed at $0.4 input and $2 output per 1M tokens. No headline benchmark score is tracked for MiMo-V2-Omni yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OpenRouter$0.400$2.00
Serverless

Capabilities

VisionMultimodalAudio

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.