audio

Whisper Transcription

Transcribe speech to text with timestamps and SRT output.

1 credits per generation

Try Whisper Transcription

Generating withWhisper Transcription1c per generation

Created with Whisper Transcription

Features

Timestamps
SRT output
100+ languages
Speaker detection

Specifications

Languages: 100+
Output: Text + SRT

Input Requirements

Audio/Video*

audio upload

Language(optional)

select

Pricing

1 credits

~$0.03 per generation

Related Models

ElevenLabs TTS

100+ voices, natural TTS

from 1 credits · $0.02+

MiniMax Speech 2.8 HD

HD expressive text-to-speech

from 1 credits · $0.02+

MiniMax Speech 2.8 Turbo

Fast, affordable text-to-speech

from 1 credits · $0.012+

ElevenLabs Sound Effects

AI sound effects from text

1 credits · $0.03+

Stable Audio

AI music generation

1 credits · $0.05+

ElevenLabs Voice Clone

Clone a voice from one sample

15 credits · $3.00

ElevenLabs Translate

AI dubbing to 10 languages

from 9 credits · $1.80-$18.00

ElevenLabs Audio Isolation

Vocal isolation & denoising

from 1 credits · $0.20-$2.00

ElevenLabs Voice Convert

Voice-to-voice transform

from 3 credits · $0.60-$6.00

MiniMax Voice Design

Custom voices from text prompt

29 credits · $6.00

Seed Audio 1.0

Prompt-driven speech + sound scenes

1 credits · $0.03+

Frequently Asked Questions

How much does Whisper Transcription cost?

Whisper Transcription costs 1 credits per generation (~$0.03). You get 10 free credits every day to try it.

Can I use Whisper Transcription outputs commercially?

Yes, all content generated with Whisper Transcription on Arteza comes with a commercial license.

What file format does Whisper Transcription output?

High-quality PNG images at your chosen resolution.

More AI Tools

Image Generator Video Generator AI Upscaler Background Remover Inpainting Outpainting Audio Studio Pricing