avatar
OmniHuman v1.5
Create hyper-realistic talking avatars from a single portrait photo and audio file. Features perfect lip sync, natural facial expressions, and gesture generation synchronized to speech rhythm and emotion.
Try OmniHuman v1.5
Generating withOmniHuman v1.5960c per generation
Created with OmniHuman v1.5
Features
- Single Photo Input
- Perfect Lip Sync
- Natural Expressions
- Gesture Generation
- Turbo Mode
- 720p/1080p Output
Specifications
- Resolution
- 720p or 1080p
- Input
- Portrait photo + Audio file
- Audio Limit
- 30s at 1080p, 60s at 720p
- Output
- MP4 Video
Input Requirements
Portrait Photo*
image upload
Clear, front-facing portrait photo
Audio File*
audio upload
Speech audio to sync (max 30s at 1080p)
Scene Description (optional)(optional)
textarea
Turbo Mode(optional)
checkbox
Resolution(optional)
select
Related Models
Frequently Asked Questions
How much does OmniHuman v1.5 cost?
OmniHuman v1.5 costs 960 credits per generation (~$9.60). New accounts get 50 free credits to try it.
How long does OmniHuman v1.5 take to generate?
Typical generation time is 60-90s. Speed depends on resolution and settings.
Can I use OmniHuman v1.5 outputs commercially?
Yes, all content generated with OmniHuman v1.5 on Arteza comes with a commercial license.
What file format does OmniHuman v1.5 output?
MP4 video files with lip-synced audio.