Stable Audio 2.0 API: Pricing, Playground & Docs

Q: Which endpoint does Stable Audio 2.0 use?

Stable Audio 2.0 is served through POST /v1/audio/generations on api.empiriolabs.ai with standard bearer-token authentication.

About Stable Audio 2.0

Generates audio up to 3 minutes from text prompts, supporting text-to-audio and audio-to-audio with adjustable duration, steps, and CFG scale.

Also known as Stable Audio, Stability AI Stable Audio 2.0, Stable-Audio-2.0, stable-audio-2-0

music generationtext to audiosound effects

Stable Audio 2.0 specs

Model ID: stable-audio-2-0
Provider: Stability AI
Category: Audio Generation
Released: Apr 3, 2024
Input: Text
Output: Audio
Endpoints: POST/v1/audio/generations
Alternate model IDs: stability-audio-2.0stability/audio-2.0

Stable Audio 2.0 API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type

Spec

Rate

Base Cost

per generation

$0.58

Per Step Cost

per step

$0.00

Compare on the full pricing page

How to call the Stable Audio 2.0 API

Stable Audio 2.0 runs through POST /v1/audio/generations. The request returns a job_id right away; poll GET /v1/jobs/{job_id} until the job completes and read the output URLs from the result. Get an API key from the EmpirioLabs dashboard.

cURL: submit the job

curl https://api.empiriolabs.ai/v1/audio/generations \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "stable-audio-2-0",
    "prompt": "Describe what you want Stable Audio 2.0 to generate."
  }'

cURL: poll for the result

curl https://api.empiriolabs.ai/v1/jobs/JOB_ID \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY"

Python

import requests

response = requests.post(
    "https://api.empiriolabs.ai/v1/audio/generations",
    headers={"Authorization": "Bearer YOUR_EMPIRIOLABS_API_KEY"},
    json={
        "model": "stable-audio-2-0",
        "prompt": "Describe what you want Stable Audio 2.0 to generate.",
    },
)
job = response.json()

# Generation runs as an async job. Poll until it completes.
import time
while True:
    status = requests.get(
        f"https://api.empiriolabs.ai/v1/jobs/{job['job_id']}",
        headers={"Authorization": "Bearer YOUR_EMPIRIOLABS_API_KEY"},
    ).json()
    if status.get("status") in ("completed", "failed"):
        print(status)
        break
    time.sleep(5)

Full Stable Audio 2.0 API reference

Stable Audio 2.0 API parameters

Request parameters supported by the Stable Audio 2.0 API on EmpirioLabs. Defaults apply when a field is omitted.

Parameter	Type	Default	Range / values	Description
prompt	string	-	-	What to generate. Be specific about genre, instruments, mood, and tempo.
mode	enum	text-to-audio	text-to-audio, audio-to-audio	text-to-audio: generate from prompt only. audio-to-audio: condition on a reference clip.
output_format	enum	mp3	mp3, wav	Output media file format (mp3, wav, mp4, png, jpg, etc., depending on the endpoint).
duration	number	190	1 to 190	Seconds. Stability Audio 2.0 generates up to 3 minutes 10 seconds.
steps	number	50	30 to 100	Diffusion steps. More = higher fidelity, slower (and adds per-step credits).
cfg_scale	number	7	1 to 25	Classifier-free guidance. Higher = follows prompt more strictly.
strength	number	1	0 to 1	Audio-to-audio only. 0 = ignore reference, 1 = stay close to reference.
random_seed	boolean	true	-	If true, use a random seed each call.
seed	number	-	-	Reproducibility seed. Only used when random_seed=false.
audio_url	string	-	-	Reference audio URL for audio-to-audio mode.

Good to know

Generates up to 3 minutes of audio from text or via audio-to-audio transformation.

Audio-to-audio mode

Requires BOTH a prompt and an uploaded audio file
Recommended CFG scale: 7-15
Recommended steps: 6-8
Typical strength: 0.3-0.7

Stable Audio 2.0 API: common questions

How much does the Stable Audio 2.0 API cost?

On EmpirioLabs, Stable Audio 2.0 is billed pay as you go: Base Cost $0.58 per generation; Per Step Cost $0.00 per step. The live rate card on this page always matches what the API charges.

Which endpoint does Stable Audio 2.0 use?

Stable Audio 2.0 is served through POST /v1/audio/generations on api.empiriolabs.ai with standard bearer-token authentication.

Can I try Stable Audio 2.0 in the browser before integrating?

Yes. The EmpirioLabs playground runs Stable Audio 2.0 in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Stable Audio 2.0 API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Stable Audio 2.0 API

About Stable Audio 2.0

Stable Audio 2.0 specs

Stable Audio 2.0 API pricing

How to call the Stable Audio 2.0 API

Stable Audio 2.0 API parameters

Good to know

Audio-to-audio mode

Stable Audio 2.0 API: common questions

How much does the Stable Audio 2.0 API cost?

Which endpoint does Stable Audio 2.0 use?

Can I try Stable Audio 2.0 in the browser before integrating?

How do I get a Stable Audio 2.0 API key?

More Audio Generation model APIs

ACE-Step 1.5 XL

TTS 2

TTS 1.5 Mini

TTS 1.5 Max

Gemini 2.5 Flash TTS

Gemini 2.5 Pro TTS

Ready to use better endpoints?