audio
W

Whisper Transcription

Transcribe speech to text with timestamps and SRT output.

3 credits per generation~~10s

Try Whisper Transcription

Generating withWhisper Transcription3c per generation

Created with Whisper Transcription

Features

  • Timestamps
  • SRT output
  • 100+ languages
  • Speaker detection

Specifications

Languages
100+
Output
Text + SRT

Input Requirements

Audio/Video*
audio upload
Language(optional)
select

Pricing

3 credits
~$0.03 per generation

Frequently Asked Questions

How much does Whisper Transcription cost?

Whisper Transcription costs 3 credits per generation (~$0.03). New accounts get 50 free credits to try it.

How long does Whisper Transcription take to generate?

Typical generation time is ~10s. Speed depends on resolution and settings.

Can I use Whisper Transcription outputs commercially?

Yes, all content generated with Whisper Transcription on Arteza comes with a commercial license.

What file format does Whisper Transcription output?

High-quality PNG images at your chosen resolution.