Xiaomi: MiMo-V2-Omni

Name: Xiaomi: MiMo-V2-Omni
Brand: xiaomi
Availability: OutOfStock
Rating: 4.2 (8 reviews)

Back

Audio input Image input Video input Text input Text output Unavailable

Author's Description

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

Key Specifications

Cost

$$$$$

Context

262K

Released

Mar 18, 2026

Speed

★★

Ability

★★★★★

Reliability

★★★★

Supported Parameters

This model supports the following parameters:

Frequency Penalty Response Format Include Reasoning Reasoning Temperature Presence Penalty Max Tokens Stop Tools Tool Choice Top P

Features

This model supports the following features:

Tools Reasoning Response Format

Performance Summary

Xiaomi's MiMo-V2-Omni demonstrates moderate speed performance, ranking in the 35th percentile across benchmarks. Its pricing tends to be at premium levels, positioned in the 14th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all evaluated benchmarks, indicating consistent and dependable operation. The model exhibits strong capabilities across several critical areas. It achieves perfect accuracy in both General Knowledge and Ethics, with the former also being the most accurate model at its price point and speed. Its Mathematics performance is particularly impressive, scoring 97.0% accuracy and ranking in the 97th percentile. Reasoning also shows high proficiency at 98.0% accuracy, placing it in the 90th percentile. While its Hallucinations score of 94.0% is respectable, it falls in the 47th percentile, suggesting some room for improvement in acknowledging uncertainty. Instruction Following and Coding are solid at 69.0% and 92.0% respectively, both ranking in the 73rd percentile. Email Classification is competent at 98.0% accuracy. Overall, MiMo-V2-Omni is a robust omni-modal model with significant strengths in complex reasoning, mathematical problem-solving, and ethical considerations, underpinned by its high reliability. Its primary areas for potential enhancement lie in reducing hallucinations and optimizing its premium cost structure.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.4
Completion	$2
Input Cache Read	$0.08

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Xiaomi	Xiaomi \| xiaomi/mimo-v2-omni-20260318	262K	$0.4 / 1M tokens	$2 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by xiaomi

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Xiaomi: MiMo-V2.5-Pro Unavailable	Apr 22, 2026	—	1M	Text input Text output	★★	★★★★	$$$$$
Xiaomi: MiMo-V2.5-Pro	Apr 22, 2026	—	1M	Text input Text output	★	★★★★★	$$$$$
Xiaomi: MiMo-V2.5 Unavailable	Apr 22, 2026	—	1M	Audio input Image input Video input Text input Text output	★★★	★★★★★	$$$$$
Xiaomi: MiMo-V2.5	Apr 22, 2026	—	1M	Audio input Image input Video input Text input Text output	★★	★★★★★	$$$$$
Xiaomi: MiMo-V2-Pro Unavailable	Mar 18, 2026	~1T	1M	Text input Text output	★	★★★★★	$$$$$
Xiaomi: MiMo-V2-Flash Unavailable	Dec 14, 2025	~309B	262K	Text input Text output	★★★★	★★★	$$