
MiniMax M2.7 is a general-purpose reasoning chat model with interleaved thinking, function calling, and prompt caching.
MiniMax M2.7 is a general-purpose reasoning chat model with interleaved thinking, function calling, and prompt caching.
Supports streaming, interleaved thinking, function calling, tool_choice, max_tokens, stop, temperature, top_p, and implicit prompt cache reads. Thinking tokens are billed as output tokens.
minimax-m2-7POST /v1/chat/completionsPOST /v1/responsesPOST /v1/messagesLive pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.
MiniMax M2.7 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id minimax-m2-7. Get an API key from the EmpirioLabs dashboard.
curl https://api.empiriolabs.ai/v1/chat/completions \
-H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "minimax-m2-7",
"messages": [
{"role": "user", "content": "Write a haiku about the ocean."}
]
}'from openai import OpenAI
client = OpenAI(
base_url="https://api.empiriolabs.ai/v1",
api_key="YOUR_EMPIRIOLABS_API_KEY",
)
response = client.chat.completions.create(
model="minimax-m2-7",
messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)Request parameters supported by the MiniMax M2.7 API on EmpirioLabs. Defaults apply when a field is omitted.
| Parameter | Type | Default | Range / values | Description |
|---|---|---|---|---|
| temperature | number | 1 | 0 to 2 | Sampling temperature. 0 = deterministic, 2 = maximum randomness. |
| top_p | number | 0.95 | 0 to 1 | Nucleus sampling probability mass. Lower = more focused. |
| max_tokens | number | 4096 | 1 to 131072 | Maximum tokens in the response. |
| stop | string | - | - | Up to 4 strings where the model will stop generating further tokens. |
| tools | array | - | - | OpenAI-style function-calling tool definitions. Each entry has name, description, parameters. |
| tool_choice | string | - | - | auto | none | required | {type:function, function:{name:"..."}}. Controls when the model must call a tool. |
| web_search_linkup | boolean | false | - | Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as... |
| disable_formatting | boolean | false | - | When enabled, the gateway will not append the "Sources" footer to assistant responses that used Linkup web search. Useful when the model output is piped to another... |
Supports interleaved thinking, function calling, and implicit prompt cache reads. Thinking is always on and billed as output tokens.
On EmpirioLabs, MiniMax M2.7 is billed pay as you go: Input $0.15 (was $0.30) per 1M prompt tokens; Output $0.60 (was $1.20) per 1M generated tokens; Implicit cache read $0.03 (was $0.06) per 1M cached input tokens. The live rate card on this page always matches what the API charges.
MiniMax M2.7 supports a 200K-token context window with up to 32,768 output tokens per response.
Yes. MiniMax M2.7 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to minimax-m2-7.
Yes. The EmpirioLabs playground runs MiniMax M2.7 in the browser with the same parameters the API exposes, so you can test prompts before writing code.
Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.
Explore our models, or contact us about business inquiries, custom deployments, or anything else.