MiMo V2.5

Xiaomi's full-modal perception model supporting native understanding of images, videos, audio, and text with 1M context. Agent performance comparable to MiMo V2.5 Pro.

xiaomi/mimo-v2.5
STABLEGet Started
Streaming
Vision
Tools
Reasoning
JSON Output

Select Provider

Xiaomi Pricing for MiMo V2.5

View detailed pricing and capabilities for this provider.

Xiaomi
Context: 1M
Input
$0.14
/M tokens
Cached
$0.028
/M tokens
Output
$0.28
/M tokens
Get Started