SoulX Podcast API

开源语音模型,用于长形,多语种播客对话,配有准语言控制(笑声,叹息)和零镜头语音克隆.

Soul AI Lab音频生成发布日期 Oct 29, 2025原生推理

About SoulX Podcast

开源语音模型,用于长形,多语种播客对话,配有准语言控制(笑声,叹息)和零镜头语音克隆.

也称为 Soul AI Lab SoulX Podcast, SoulX-Podcast

voice cloningmulti speakerdialectpodcast

SoulX Podcast specs

模型 ID
soulx-podcast
提供方
Soul AI Lab
类别
音频生成
Released
Oct 29, 2025
输入
TextAudio
输出
Audio
端点
POST /v1/audio/speech

SoulX Podcast API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

类型
规格
价格
基础
每1k个字符
$0.015
对话框
每1k个字符
$0.015
在完整价格页比较

How to call the SoulX Podcast API

SoulX Podcast serves speech through POST /v1/audio/speech and returns playable audio. Send the text to speak as input with the model id soulx-podcast. Get an API key from the EmpirioLabs dashboard.

cURL
curl https://api.empiriolabs.ai/v1/audio/speech \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "soulx-podcast",
    "input": "Welcome to EmpirioLabs. Your build just finished."
  }' \
  --output speech.mp3
Python
import requests

response = requests.post(
    "https://api.empiriolabs.ai/v1/audio/speech",
    headers={"Authorization": "Bearer YOUR_EMPIRIOLABS_API_KEY"},
    json={"model": "soulx-podcast", "input": "Welcome to EmpirioLabs."},
)
with open("speech.mp3", "wb") as f:
    f.write(response.content)
Full SoulX Podcast API reference

SoulX Podcast API parameters

Request parameters supported by the SoulX Podcast API on EmpirioLabs. Defaults apply when a field is omitted.

参数类型默认范围 / 值描述
inputstring--Podcast script. Use [S1] / [S2] / [S3] / [S4] tags or 'Speaker N:' lines for multi-speaker. Paralinguistic tags supported: <|laughter|>, <|sigh|>, <|breath|>, <|cough|>.
voice_modelenumbasebase, dialectbase: English + Mandarin. dialect: adds Sichuan, Henan, and Cantonese.
voice_s1enumarthurarthur, james, lj, xiaomei, zhigang, custom_s1Voice for [S1]. lj = Emma. custom_s1 requires voice_s1_audio_url.
voice_s2enumljarthur, james, lj, xiaomei, zhigang, custom_s2Voice for [S2]. lj = Emma.
voice_s3enumjamesarthur, james, lj, xiaomei, zhigang, custom_s3Voice for [S3].
voice_s4enumxiaomeiarthur, james, lj, xiaomei, zhigang, custom_s4Voice for [S4].
voice_s1_audio_urlstring--Reference audio URL for [S1] custom-voice cloning. Speaker must say the consent phrase aloud.
voice_s2_audio_urlstring--Reference audio URL for [S2] custom-voice cloning.
voice_s3_audio_urlstring--Reference audio URL for [S3] custom-voice cloning.
voice_s4_audio_urlstring--Reference audio URL for [S4] custom-voice cloning.
temperaturenumber0.60.1 to 2Sampling temperature.
top_knumber1001 to 500Top-k sampling cap.
top_pnumber0.90.1 to 1Nucleus sampling.
repetition_penaltynumber1.251 to 2Higher values discourage repeated phrasing.
3 more parameters in the docs

须知

开源语音模型,用于长形,多语种播客对话,配有准语言控制和零镜头语音克隆.

SoulX Podcast API: common questions

How much does the SoulX Podcast API cost?

On EmpirioLabs, SoulX Podcast is billed pay as you go: 基础 $0.015 每1k个字符; 对话框 $0.015 每1k个字符. The live rate card on this page always matches what the API charges.

Which endpoint does SoulX Podcast use?

SoulX Podcast is served through POST /v1/audio/speech on api.empiriolabs.ai with standard bearer-token authentication.

Can I try SoulX Podcast in the browser before integrating?

Yes. The EmpirioLabs playground runs SoulX Podcast in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a SoulX Podcast API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Check out our pricing or reach out if you want your own model deployed on our stack.