InfiniteTalk

High-fidelity talking head video generation with precise lip-sync and natural head motion.

GPU Min12GB
GPU Rec24GB
Disk Min20GB
Disk Rec60GB
ComfyUI: 1.37.11
Last Updated: 1/28/2026
Max frames
kjnodes
value
500
negative_prompt
STRING
bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards
positive_prompt
STRING
The man speaking
LoadAudio
-core
AUDIO
hello.mp3
null
null
Width
kjnodes
value
480
Height
kjnodes
value
848
LoadImage
-core
IMAGE
MASK
useflow (1).png
image
Create Video - InfiniteTalk
-core
image
width
height
num_frames
audio
positive_prompt
negative_prompt
images
width
height
num_frames
positive_prompt
negative_prompt
model
lora
model_name
clip_name
VHS_VideoCombine
videohelpersuite
images
audio
meta_batch
vae
Filenames
frame_rate25
loop_count0
filename_prefixuseflow/v
formatvideo/h264-mp4
pix_fmtyuv420p
crf19
save_metadatatrue
trim_to_audiofalse
pingpongfalse
save_outputtrue
Readme

Useflow - Simple workflows that actually work

Model links

GGUF

Wan2.1-i2v 480p or 720p

InfiniteTalk

LORA

clip_vision

text_encoders

vae

wav2vec2

MelBandRoFormer

Model Storage Location

šŸ“‚ ComfyUI/
ā”œā”€ā”€ šŸ“‚ models/
│   ā”œā”€ā”€ šŸ“‚ diffusion_models/
│   │      ā”œā”€ā”€ wan2.1-i2v-14b-480p_xxxx.gguf || wan2.1-i2v-14b-720p_xxxx.gguf
│   │      ā”œā”€ā”€ Wan2_1-InfiniteTalk_Single_xxxx.gguf
│   │      └── MelBandRoformer_fp16.safetensors
│   ā”œā”€ā”€ šŸ“‚ loras/
│   │      └── lightx2v_I2V_14B_480p_cfg_step_distill_rank64_bf16.safetensors
│   ā”œā”€ā”€ šŸ“‚ clip_vision/
│   │      └── clip_vision_h.safetensors
│   ā”œā”€ā”€ šŸ“‚ text_encoders/
│   │      └── umt5-xxl-enc-bf16.safetensors
│   ā”œā”€ā”€ šŸ“‚ vae/
│   │      └── Wan2_1_VAE_bf16.safetensors
│   └── šŸ“‚ wav2vec2/
│          └── wav2vec2-chinese-base_fp16.safetensors
Zoom: 100%
workflow

Required Assets

GGUF

wan2.1-i2v-14b-480p_xxxx.gguf

Target: ComfyUI/models

File: wan2.1-i2v-14b-480p_xxxx.gguf

Download →

wan2.1-i2v-14b-720p_xxxx.gguf

Target: ComfyUI/models

File: wan2.1-i2v-14b-720p_xxxx.gguf

Download →

Diffusion Models

Wan2_1-InfiniteTalk_Single_xxxx.gguf

Target: ComfyUI/models/diffusion_models

File: Wan2_1-InfiniteTalk_Single_xxxx.gguf

Download →

MelBandRoformer_fp16.safetensors

Target: ComfyUI/models/diffusion_models

File: MelBandRoformer_fp16.safetensors

Download →

Loras

lightx2v_I2V_14B_480p_cfg_step_distill_rank64_bf16.safetensors

Target: ComfyUI/models/loras

File: lightx2v_I2V_14B_480p_cfg_step_distill_rank64_bf16.safetensors

Download →

Clip Vision

clip_vision_h.safetensors

Target: ComfyUI/models/clip_vision

File: clip_vision_h.safetensors

Download →

Text Encoders

umt5-xxl-enc-bf16.safetensors

Target: ComfyUI/models/text_encoders

File: umt5-xxl-enc-bf16.safetensors

Download →

Vae

Wan2_1_VAE_bf16.safetensors

Target: ComfyUI/models/vae

File: Wan2_1_VAE_bf16.safetensors

Download →

Wav2vec2

wav2vec2-chinese-base_fp16.safetensors

Target: ComfyUI/models

File: wav2vec2-chinese-base_fp16.safetensors

Download →

Workflow File

Download the workflow JSON to import into ComfyUI.

Download workflow.json