Home Blog

How to Use the Kling 3.0 Turbo API

Kling 3.0 Turbo via API on EmpirioLabs

Jun 18, 2026

EmpirioLabs AI

Kling 3.0 Turbo is an image-to-video model from Kling AI, now live on EmpirioLabs. Give it one source image and a short prompt describing the motion, and it animates the image into a smooth video with synchronized native audio. It renders at 720p or 1080p for 3 to 15 seconds, and it is built for speed, so clips come back quickly.

It is image-to-video only: an image is required on every request, and a text prompt is optional but recommended to direct the motion. Native audio is generated automatically, so the output already has sound.

Pricing

Kling 3.0 Turbo is pay-as-you-go, billed per second of video generated, with 1080p priced higher per second than 720p. There is no subscription and no minimum, and a request that fails before any video is produced is not billed. The live per-second rates are on the Kling 3.0 Turbo model page and the pricing page.

Quickstart

The API is OpenAI-compatible. Submit a generation, then poll the returned job until it is ready. Set your key as EMPIRIOLABS_API_KEY and call the videos endpoint:

curl https://api.empiriolabs.ai/v1/videos/generations \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kling-3-0-turbo",
    "image": "https://your-host.com/source.jpg",
    "prompt": "cinematic slow push-in, gentle wind, drifting clouds",
    "resolution": "720p",
    "duration": 5
  }'

The response returns a job_id and a poll_url. Poll it until the status is completed:

curl https://api.empiriolabs.ai/v1/jobs/JOB_ID \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY"

When it completes, the response includes a signed video URL you can download. The source image can be passed as a public URL or as base64, in JPG or PNG.

Parameters

  • image (required): the source image to animate.
  • prompt: describes the motion to apply.
  • resolution: 720p or 1080p.
  • duration: video length in seconds, 3 to 15.

Good to know

  • Image-to-video only, so an image is always required. Use a real photo or rendered frame; logos or graphics with transparent backgrounds may be rejected.
  • Native audio is always on, so the clip arrives with sound.
  • Generation is asynchronous: submit, then poll the job. Typical clips finish in well under a minute.
  • You only pay for video that is produced; failed requests are not billed.

Multi-shot prompts

Kling 3.0 Turbo can render up to 6 sequential shots in a single video. Format the prompt as shot 1, <seconds>, <description>; shot 2, <seconds>, <description>; where each shot lasts at least 1 second and the shot durations add up to the total duration. For example, a 6 second video: shot 1, 3, a lone astronaut walks across a red desert at dawn; shot 2, 3, the camera tilts up to reveal two pale moons;

Try it in the playground, read the API docs, or open the model page.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.