avatar

OmniHuman v1.5

Create hyper-realistic talking avatars from a single portrait photo and audio file. Features perfect lip sync, natural facial expressions, and gesture generation synchronized to speech rhythm and emotion.

2 credits per generation

Try OmniHuman v1.5

Generating withOmniHuman v1.52c per generation

Created with OmniHuman v1.5

Features

Single Photo Input
Perfect Lip Sync
Natural Expressions
Gesture Generation
Turbo Mode
720p/1080p Output

Specifications

Resolution: 720p or 1080p
Input: Portrait photo + Audio file
Audio Limit: 30s
Output: MP4 Video

Input Requirements

Portrait Photo*

image upload

Clear, front-facing portrait photo

Audio File*

audio upload

Speech audio to sync (max 60s at 720p, 30s at 1080p)

Scene Description (optional)(optional)

textarea

Turbo Mode(optional)

checkbox

Resolution(optional)

select

Pricing

from 2 credits

~$0.32-$9.60 per generation

Related Models

Kling Avatar v2

Versatile lip sync for any character

from 2 credits · $0.23-$13.80

SadTalker

Budget avatar from photo + audio

5 credits · $1.00

Sync-3 Lipsync

Video dubbing with 4K lip sync

from 2 credits · $0.27-$16.01

Hunyuan Avatar

Talking and singing, up to 120s

Fabric 1.0

Photo + audio talking avatar

from 1 credits · $0.16/s+

Infini Talk

Audio-driven talking avatar

from 4 credits · $0.40/s+

Wan 2.2 S2V

Speech-to-video from photo + audio

from 3 credits · $0.50-$3.00

Frequently Asked Questions

How much does OmniHuman v1.5 cost?

OmniHuman v1.5 costs 2 credits per generation (~$0.32-$9.60). You get 10 free credits every day to try it.

Can I use OmniHuman v1.5 outputs commercially?

Yes, all content generated with OmniHuman v1.5 on Arteza comes with a commercial license.

What file format does OmniHuman v1.5 output?

MP4 video files with lip-synced audio.

More AI Tools

Image Generator Video Generator AI Upscaler Background Remover Inpainting Outpainting Audio Studio Pricing