
MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.
MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.
Supports text, image, and video inputs, adaptive thinking through enable_thinking, function calling, and implicit prompt cache reads. Pricing is tiered by prompt tokens including cache hits: <=512K uses the discounted standard tier, while >512K uses the high-context tier. Thinking tokens are billed as output tokens. Linkup web search is available through web_search_linkup and adds $0.013 per successful search.
minimax-m3POST /v1/chat/completionsPOST /v1/responsesPOST /v1/messagesLive pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.
MiniMax M3 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id minimax-m3. Get an API key from the EmpirioLabs dashboard.
curl https://api.empiriolabs.ai/v1/chat/completions \
-H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "minimax-m3",
"messages": [
{"role": "user", "content": "Write a haiku about the ocean."}
]
}'from openai import OpenAI
client = OpenAI(
base_url="https://api.empiriolabs.ai/v1",
api_key="YOUR_EMPIRIOLABS_API_KEY",
)
response = client.chat.completions.create(
model="minimax-m3",
messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)Request parameters supported by the MiniMax M3 API on EmpirioLabs. Defaults apply when a field is omitted.
| Parameter | Type | Default | Range / values | Description |
|---|---|---|---|---|
| temperature | number | 1 | 0 to 2 | Controls randomness. Lower values are more deterministic; higher values are more exploratory. |
| top_p | number | 0.95 | 0 to 1 | Controls nucleus sampling by limiting generation to the most likely token mass. |
| max_completion_tokens | integer | 4096 | 1 to 524288 | Maximum generated tokens, including reasoning tokens when thinking is enabled. |
| stop | array | - | - | Optional stop sequence or list of stop sequences. |
| enable_thinking | boolean | true | - | Enable adaptive model thinking before answering. Set false to request a direct answer without a reasoning phase. |
| web_search_linkup | boolean | false | - | Search the web for recent information using Linkup before MiniMax M3 answers. Adds $0.013 to successful requests when enabled. |
| tools | array | - | - | OpenAI-compatible tool definitions for function calling. |
| tool_choice | object | - | - | Optional OpenAI-compatible tool_choice value. |
Text, image, and video input with text output. Thinking defaults to adaptive and can be disabled. Pricing is based on input tokens including cache hits: <=512K uses the discounted standard tier, while >512K uses the high-context tier. Current standard route supports up to 524K tokens. 1M context is rolling out. Linkup web search is available through web_search_linkup and adds $0.013 per successful search.
On EmpirioLabs, MiniMax M3 is billed pay as you go: Input <=512K $0.225 (was $0.30); >512K $1.20 per 1M prompt tokens; Output <=512K $0.90 (was $1.20); >512K $4.80 per 1M generated tokens; Implicit cache read <=512K $0.045 (was $0.06); >512K $0.24 per 1M cached input tokens. The live rate card on this page always matches what the API charges.
MiniMax M3 supports a 524K-token context window with up to 524,288 output tokens per response.
Yes. MiniMax M3 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to minimax-m3.
Yes. The EmpirioLabs playground runs MiniMax M3 in the browser with the same parameters the API exposes, so you can test prompts before writing code.
Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.
Explore our models, or contact us about business inquiries, custom deployments, or anything else.