avatarNEW
IT
Infini Talk
Audio-driven talking avatar with expressive motion from a single image. Output length is controlled by frames at 25 fps, up to roughly 28 seconds.
Try Infini Talk
Generating withInfini Talk4c per generation
Created with Infini Talk
Features
- Photo + Audio Input
- Expressive Motion
- 480p / 720p
- Frame-Bounded Output
Specifications
- Resolution
- 480p or 720p
- Input
- Photo + Audio
- Audio Limit
- 28s
- Output
- MP4 Video
Input Requirements
Portrait Photo*
image upload
Front-facing portrait photo
Audio File*
audio upload
Speech audio to sync (max 28s)
Scene Description*
textarea
Resolution(optional)
select
Acceleration(optional)
select
Seed(optional)
number
Related Models
OmniHuman v1.5
Photo + Audio to talking avatar
from 2 credits · $0.32-$9.60
Kling Avatar v2
Versatile lip sync for any character
from 2 credits · $0.23-$13.80
SadTalker
Budget avatar from photo + audio
5 credits · $1.00
Sync-3 Lipsync
Video dubbing with 4K lip sync
from 2 credits · $0.27-$16.01
Hunyuan Avatar
Talking and singing, up to 120s
Fabric 1.0
Photo + audio talking avatar
from 1 credits · $0.16/s+
Wan 2.2 S2V
Speech-to-video from photo + audio
from 3 credits · $0.50-$3.00
Frequently Asked Questions
How much does Infini Talk cost?
Infini Talk costs 4 credits per generation (~$0.40/s+). You get 10 free credits every day to try it.
Can I use Infini Talk outputs commercially?
Yes, all content generated with Infini Talk on Arteza comes with a commercial license.
What file format does Infini Talk output?
MP4 video files with lip-synced audio.