Stable Audio 2.0 API

Generates audio up to 3 minutes from text prompts, supporting text-to-audio and audio-to-audio with adjustable duration, steps, and CFG scale.

Stability AIAudio GenerationProprietary Endpoint

About Stable Audio 2.0

Generates audio up to 3 minutes from text prompts, supporting text-to-audio and audio-to-audio with adjustable duration, steps, and CFG scale.

Also known as Stable Audio, Stable-Audio-2.0

Stable Audio 2.0 specs

Model ID
stable-audio-2-0
Provider
Stability AI
Category
Audio Generation
Input
text
Output
audio
Endpoints
POST /v1/audio/generations

Stable Audio 2.0 API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type
Spec
Rate
Base Cost
per generation
$0.58
Per Step Cost
per step
$0.00
Compare on the full pricing page

How to call the Stable Audio 2.0 API

Stable Audio 2.0 runs through POST /v1/audio/generations. The request returns a job_id right away; poll GET /v1/jobs/{job_id} until the job completes and read the output URLs from the result. Get an API key from the EmpirioLabs dashboard.

cURL: submit the job
curl https://api.empiriolabs.ai/v1/audio/generations \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "stable-audio-2-0",
    "prompt": "Describe what you want Stable Audio 2.0 to generate."
  }'
cURL: poll for the result
curl https://api.empiriolabs.ai/v1/jobs/JOB_ID \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY"
Python
import requests

response = requests.post(
    "https://api.empiriolabs.ai/v1/audio/generations",
    headers={"Authorization": "Bearer YOUR_EMPIRIOLABS_API_KEY"},
    json={
        "model": "stable-audio-2-0",
        "prompt": "Describe what you want Stable Audio 2.0 to generate.",
    },
)
job = response.json()

# Generation runs as an async job. Poll until it completes.
import time
while True:
    status = requests.get(
        f"https://api.empiriolabs.ai/v1/jobs/{job['job_id']}",
        headers={"Authorization": "Bearer YOUR_EMPIRIOLABS_API_KEY"},
    ).json()
    if status.get("status") in ("completed", "failed"):
        print(status)
        break
    time.sleep(5)
Full Stable Audio 2.0 API reference

Stable Audio 2.0 API parameters

Request parameters supported by the Stable Audio 2.0 API on EmpirioLabs. Defaults apply when a field is omitted.

ParameterTypeDefaultRange / valuesDescription
promptstring--What to generate. Be specific about genre, instruments, mood, and tempo.
modeenumtext-to-audiotext-to-audio, audio-to-audiotext-to-audio: generate from prompt only. audio-to-audio: condition on a reference clip.
output_formatenummp3mp3, wavOutput media file format (mp3, wav, mp4, png, jpg, etc., depending on the endpoint).
durationnumber1901 to 190Seconds. Stability Audio 2.0 generates up to 3 minutes 10 seconds.
stepsnumber5030 to 100Diffusion steps. More = higher fidelity, slower (and adds per-step credits).
cfg_scalenumber71 to 25Classifier-free guidance. Higher = follows prompt more strictly.
strengthnumber10 to 1Audio-to-audio only. 0 = ignore reference, 1 = stay close to reference.
random_seedbooleantrue-If true, use a random seed each call.
seednumber--Reproducibility seed. Only used when random_seed=false.
audio_urlstring--Reference audio URL for audio-to-audio mode.

Good to know

Generates up to 3 minutes of audio from text or via audio-to-audio transformation.

Audio-to-audio mode

  • Requires BOTH a prompt and an uploaded audio file
  • Recommended CFG scale: 7-15
  • Recommended steps: 6-8
  • Typical strength: 0.3-0.7

Stable Audio 2.0 API: common questions

How much does the Stable Audio 2.0 API cost?

On EmpirioLabs, Stable Audio 2.0 is billed pay as you go: Base Cost $0.58 per generation; Per Step Cost $0.00 per step. The live rate card on this page always matches what the API charges.

Which endpoint does Stable Audio 2.0 use?

Stable Audio 2.0 is served through POST /v1/audio/generations on api.empiriolabs.ai with standard bearer-token authentication.

Can I try Stable Audio 2.0 in the browser before integrating?

Yes. The EmpirioLabs playground runs Stable Audio 2.0 in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Stable Audio 2.0 API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.