DeepSeek V3.2 API

Open-source Mixture-of-Experts LLM tuned for high-efficiency reasoning, coding, and general language tasks across long-form prompts.

DeepSeekText Generation128K contextSingaporeProprietary Endpoint

About DeepSeek V3.2

Open-source Mixture-of-Experts LLM tuned for high-efficiency reasoning, coding, and general language tasks across long-form prompts.

reasoning

DeepSeek V3.2 specs

Model ID
deepseek-v3-2
Provider
DeepSeek
Category
Text Generation
Context window
128K tokens
Max output
32,768 tokens
Input
text
Output
text
Region
Singapore
Endpoints
POST /v1/chat/completions
POST /v1/responses
POST /v1/messages

DeepSeek V3.2 API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type
Spec
Rate
Input
per 1M prompt tokens
$0.57
Output
per 1M generated tokens
$1.71
Web Search
per call
$0.015
Compare on the full pricing page

How to call the DeepSeek V3.2 API

DeepSeek V3.2 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id deepseek-v3-2. Get an API key from the EmpirioLabs dashboard.

cURL
curl https://api.empiriolabs.ai/v1/chat/completions \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v3-2",
    "messages": [
      {"role": "user", "content": "Write a haiku about the ocean."}
    ]
  }'
Python (OpenAI SDK)
from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

response = client.chat.completions.create(
    model="deepseek-v3-2",
    messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)
Full DeepSeek V3.2 API reference

DeepSeek V3.2 API parameters

Request parameters supported by the DeepSeek V3.2 API on EmpirioLabs. Defaults apply when a field is omitted.

ParameterTypeDefaultRange / valuesDescription
temperaturenumber0.70 to 2Sampling temperature
top_pnumber0.90 to 1Nucleus sampling
max_tokensnumber40961 to 65536Max output tokens
enable_thinkingbooleantrue-Enable step-by-step reasoning before answering.
thinking_budgetnumber327681 to 393216Maximum tokens reserved for the reasoning process. Up to 393216.
reasoning_effortenummediumnone, low, medium, high, maxReasoning effort level. none disables thinking. low, medium, high, and max set bounded thinking budgets sized to the selected model. Sent as an OpenAI-style...
enable_searchbooleanfalse-Allow real-time web search. Billed only when the provider reports search usage.

Good to know

Web search calls cost $0.015 each — only billed when invoked. Reasoning tokens (CoT) bill as output tokens.

DeepSeek V3.2 API: common questions

How much does the DeepSeek V3.2 API cost?

On EmpirioLabs, DeepSeek V3.2 is billed pay as you go: Input $0.57 per 1M prompt tokens; Output $1.71 per 1M generated tokens; Web Search $0.015 per call. The live rate card on this page always matches what the API charges.

What is the context window of DeepSeek V3.2?

DeepSeek V3.2 supports a 128K-token context window with up to 32,768 output tokens per response.

Is the DeepSeek V3.2 API OpenAI-compatible?

Yes. DeepSeek V3.2 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to deepseek-v3-2.

Can I try DeepSeek V3.2 in the browser before integrating?

Yes. The EmpirioLabs playground runs DeepSeek V3.2 in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a DeepSeek V3.2 API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.