MiniMax M3 API

MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.

MiniMaxText Generation524K contextSingaporeProprietary EndpointNew

About MiniMax M3

MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.

Supports text, image, and video inputs, adaptive thinking through enable_thinking, function calling, and implicit prompt cache reads. Pricing is tiered by prompt tokens including cache hits: <=512K uses the discounted standard tier, while >512K uses the high-context tier. Thinking tokens are billed as output tokens. Linkup web search is available through web_search_linkup and adds $0.013 per successful search.

reasoningvisionvideoweb searchfunction callingcachelong context

MiniMax M3 specs

Model ID
minimax-m3
Provider
MiniMax
Category
Text Generation
Context window
524K tokens
Max output
524,288 tokens
Input
text, image, video
Output
text
Region
Singapore
Endpoints
POST /v1/chat/completions
POST /v1/responses
POST /v1/messages

MiniMax M3 API pricingSave up to 25%

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type
Spec
Rate
Input
per 1M prompt tokens
$0.30<=512K $0.225>512K $1.20
Output
per 1M generated tokens
$1.20<=512K $0.90>512K $4.80
Implicit cache read
per 1M cached input tokens
$0.06<=512K $0.045>512K $0.24
Linkup web search
per successful search when enabled
$0.013
Compare on the full pricing page

How to call the MiniMax M3 API

MiniMax M3 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id minimax-m3. Get an API key from the EmpirioLabs dashboard.

cURL
curl https://api.empiriolabs.ai/v1/chat/completions \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax-m3",
    "messages": [
      {"role": "user", "content": "Write a haiku about the ocean."}
    ]
  }'
Python (OpenAI SDK)
from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

response = client.chat.completions.create(
    model="minimax-m3",
    messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)
Full MiniMax M3 API reference

MiniMax M3 API parameters

Request parameters supported by the MiniMax M3 API on EmpirioLabs. Defaults apply when a field is omitted.

ParameterTypeDefaultRange / valuesDescription
temperaturenumber10 to 2Controls randomness. Lower values are more deterministic; higher values are more exploratory.
top_pnumber0.950 to 1Controls nucleus sampling by limiting generation to the most likely token mass.
max_completion_tokensinteger40961 to 524288Maximum generated tokens, including reasoning tokens when thinking is enabled.
stoparray--Optional stop sequence or list of stop sequences.
enable_thinkingbooleantrue-Enable adaptive model thinking before answering. Set false to request a direct answer without a reasoning phase.
web_search_linkupbooleanfalse-Search the web for recent information using Linkup before MiniMax M3 answers. Adds $0.013 to successful requests when enabled.
toolsarray--OpenAI-compatible tool definitions for function calling.
tool_choiceobject--Optional OpenAI-compatible tool_choice value.

Good to know

Text, image, and video input with text output. Thinking defaults to adaptive and can be disabled. Pricing is based on input tokens including cache hits: <=512K uses the discounted standard tier, while >512K uses the high-context tier. Current standard route supports up to 524K tokens. 1M context is rolling out. Linkup web search is available through web_search_linkup and adds $0.013 per successful search.

MiniMax M3 API: common questions

How much does the MiniMax M3 API cost?

On EmpirioLabs, MiniMax M3 is billed pay as you go: Input <=512K $0.225 (was $0.30); >512K $1.20 per 1M prompt tokens; Output <=512K $0.90 (was $1.20); >512K $4.80 per 1M generated tokens; Implicit cache read <=512K $0.045 (was $0.06); >512K $0.24 per 1M cached input tokens. The live rate card on this page always matches what the API charges.

What is the context window of MiniMax M3?

MiniMax M3 supports a 524K-token context window with up to 524,288 output tokens per response.

Is the MiniMax M3 API OpenAI-compatible?

Yes. MiniMax M3 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to minimax-m3.

Can I try MiniMax M3 in the browser before integrating?

Yes. The EmpirioLabs playground runs MiniMax M3 in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a MiniMax M3 API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.