Xiaomi MiMo-V2-Omni
ProprietaryMultimodal
About
Multimodal model supporting text, image, video, audio. $0.40/$2.00 per M tokens
Capabilities
VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution
Multimodal model supporting text, image, video, audio. $0.40/$2.00 per M tokens