audio

Seed Audio 1.0

ByteDance Seed Audio 1.0. Prompt-driven speech and sound-scene generation: describe the dialogue, narration, or ambience and Seed Audio renders expressive audio. Optional steering with a single reference image OR up to three reference-audio clips (never both). Reuse your own cloned voices for a consistent speaker. English and Chinese, up to two minutes per clip, billed on the real output length.

1 credits per generation

Try Seed Audio 1.0

Generating withSeed Audio 1.01c per generation

Created with Seed Audio 1.0

Features

  • Prompt-driven scenes
  • Image or audio steering
  • English and Chinese
  • Reuse cloned voices
  • Speed, volume and pitch control
  • Up to 2 minutes

Specifications

Languages
English, Chinese
Max Length
2 minutes
Steering
Image or reference audio
Input
Prompt + optional image / reference audio
Output
MP3 audio

Input Requirements

Prompt*
textarea
Voice(optional)
select
Speed(optional)
slider
Volume(optional)
slider
Pitch(optional)
slider

Pricing

1 credits
~$0.03+ per generation

Related Models

Frequently Asked Questions

How much does Seed Audio 1.0 cost?

Seed Audio 1.0 costs 1 credits per generation (~$0.03+). You get 10 free credits every day to try it.

Can I use Seed Audio 1.0 outputs commercially?

Yes, all content generated with Seed Audio 1.0 on Arteza comes with a commercial license.

What file format does Seed Audio 1.0 output?

High-quality PNG images at your chosen resolution.