オーディオ生成 APIs

Speech, music, and sound generation models on a unified API.

10 models

About オーディオ生成 on EmpirioLabs

Audio generation models on EmpirioLabs cover text-to-speech, music, and sound effects, with options like multiple speakers or voice cloning depending on the model. You send your text or prompt and retrieve audio, billed per generation or per minute.

オーディオ生成 models (10)

How to call オーディオ生成 models

ACE-Step 1.5 XL runs through POST /v1/audio/generations. The request returns a job_id right away; poll GET /v1/jobs/{job_id} until the job completes and read the output URLs from the result. Swap the model id for any model above. Get an API key from the EmpirioLabs dashboard.

cURL: submit the job
curl https://api.empiriolabs.ai/v1/audio/generations \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ace-step-1.5-xl",
    "prompt": "Describe what you want ACE-Step 1.5 XL to generate."
  }'

オーディオ生成 APIs: common questions

How many オーディオ生成 model APIs does EmpirioLabs offer?

EmpirioLabs lists 10 オーディオ生成 models, including ACE-Step 1.5 XL, TTS 1.5 Mini, TTS 1.5 Max. Each model has its own dedicated API page with live pricing, parameters, and a quickstart.

How are オーディオ生成 APIs priced on EmpirioLabs?

Every オーディオ生成 model is billed pay as you go, with no monthly minimum. The exact rate card lives on each model's page and always matches what the API charges.

Do I need to be a developer to use オーディオ生成 models?

No. Every model here runs in the EmpirioLabs playground, a friendly in-browser interface where you can set the options and see results without writing any code. When you are ready to automate, the same model is available through the API.

Other model categories

モデルカタログ全体を見る

Ready to use better endpoints?

Check out our pricing or reach out if you want your own model deployed on our stack.