오디오 생성 Model APIs: Pricing, Playground & Docs

About 오디오 생성 on EmpirioLabs

Audio generation models on EmpirioLabs cover text-to-speech, music, and sound effects, with options like multiple speakers or voice cloning depending on the model. You send your text or prompt and retrieve audio, billed per generation or per minute.

오디오 생성 models (10)

How to call 오디오 생성 models

ACE-Step 1.5 XL runs through POST /v1/audio/generations. The request returns a job_id right away; poll GET /v1/jobs/{job_id} until the job completes and read the output URLs from the result. Swap the model id for any model above. Get an API key from the EmpirioLabs dashboard.

cURL: submit the job

curl https://api.empiriolabs.ai/v1/audio/generations \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ace-step-1.5-xl",
    "prompt": "Describe what you want ACE-Step 1.5 XL to generate."
  }'

오디오 생성 APIs: common questions

How many 오디오 생성 model APIs does EmpirioLabs offer?

EmpirioLabs lists 10 오디오 생성 models, including ACE-Step 1.5 XL, TTS 1.5 Mini, TTS 1.5 Max. Each model has its own dedicated API page with live pricing, parameters, and a quickstart.

How are 오디오 생성 APIs priced on EmpirioLabs?

Every 오디오 생성 model is billed pay as you go, with no monthly minimum. The exact rate card lives on each model's page and always matches what the API charges.

Do I need to be a developer to use 오디오 생성 models?

No. Every model here runs in the EmpirioLabs playground, a friendly in-browser interface where you can set the options and see results without writing any code. When you are ready to automate, the same model is available through the API.

Other model categories

텍스트 생성55 이미지 생성7 영상 생성17 이름 *3 연구 및 검색13 3D 세대1 관련 상품3 채용 정보1 도구 및 에이전트2

전체 모델 카탈로그 보기

오디오 생성 APIs

About 오디오 생성 on EmpirioLabs

오디오 생성 models (10)

ACE-Step 1.5 XL

TTS 1.5 Mini

TTS 1.5 Max

Gemini 2.5 Flash TTS

Gemini 2.5 Pro TTS

Gemini 3.1 Flash TTS

GLM TTS

SoulX Podcast

Stable Audio 2.0

Stable Audio 2.5