MiniMax M3

MiniMax M3 is a multimodal foundation model with 1M token context, native multimodal understanding, and MiniMax Sparse Attention (MSA) for efficient long-context inference.

minimax-m3
STABLEGet StartedView uptime
1,048,576 context
Starting at $0.60/M input tokens
Starting at $2.40/M output tokens
Streaming
Vision
Tools
Reasoning
JSON Output

Select Provider

All Providers for MiniMax M3

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

MiniMax
Context: 1.0M
Input
$0.6
/M tokens
Cached
$0.12
/M tokens
Output
$2.4
/M tokens
Get Started