
Hybrid model unifying Instruct, Reasoning (Magistral), and Devstral families: 40% lower completion time and 3x throughput vs Small 3.
Hybrid model unifying Instruct, Reasoning (Magistral), and Devstral families: 40% lower completion time and 3x throughput vs Small 3.
mistral-small-4POST /v1/chat/completionsPOST /v1/responsesPOST /v1/messagesLive pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.
Mistral Small 4 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id mistral-small-4. Get an API key from the EmpirioLabs dashboard.
curl https://api.empiriolabs.ai/v1/chat/completions \
-H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "mistral-small-4",
"messages": [
{"role": "user", "content": "Write a haiku about the ocean."}
]
}'from openai import OpenAI
client = OpenAI(
base_url="https://api.empiriolabs.ai/v1",
api_key="YOUR_EMPIRIOLABS_API_KEY",
)
response = client.chat.completions.create(
model="mistral-small-4",
messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)Request parameters supported by the Mistral Small 4 API on EmpirioLabs. Defaults apply when a field is omitted.
| Parameter | Type | Default | Range / values | Description |
|---|---|---|---|---|
| reasoning_enabled | boolean | true | - | Enable extended reasoning (maps to reasoning: high|none) |
| tool_web_search | boolean | true | - | Enable web_search tool |
| web_search_tier | enum | standard | standard, premium | Standard or Premium web-search tier. Premium uses higher-quality sources. |
| tool_code_interpreter | boolean | true | - | Allow the model to execute Python code in a sandbox to compute / analyze data. |
| tool_image_generation | boolean | true | - | Allow the model to generate images inline via the platform image-gen tool. |
| temperature | number | 0.7 | 0 to 1.5 | Sampling temperature. 0 = deterministic, 2 = maximum randomness. |
| max_tokens | number | 4096 | 1 to 32768 | Maximum tokens in the response. |
| disable_formatting | boolean | false | - | Skip the EmpirioLabs Markdown formatting (citation [[N]](url) rewriting + References block when the web_search tool was used). The raw upstream answer with plain [N]... |
Tools (web search, code interpreter, image generation) are billed only when actually invoked.
On EmpirioLabs, Mistral Small 4 is billed pay as you go: Input $0.15 per 1M prompt tokens; Output $0.60 per 1M generated tokens; Standard Web Search $0.084 per call. The live rate card on this page always matches what the API charges.
Mistral Small 4 supports a 256K-token context window with up to 65,536 output tokens per response.
Yes. Mistral Small 4 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to mistral-small-4.
Yes. The EmpirioLabs playground runs Mistral Small 4 in the browser with the same parameters the API exposes, so you can test prompts before writing code.
Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.
Explore our models, or contact us about business inquiries, custom deployments, or anything else.