Generation

T2V — Text-to-Video

Resolution
0.5 5.1

⌚ ZeroGPU reservation: ~110s

Send to:

I2V — Image-to-Video

Resolution
0.5 5.1

⌚ ZeroGPU reservation: ~120s

Send to:

TI2V — Text+Image to Video (Wan 2.2-5B, via upstream wan package)

TI2V-5B is locked to 1280×704 (landscape) or 704×1280 (portrait), 121 frames @ 24 fps.

Orientation

⌚ ZeroGPU reservation: ~60s

Send to:

FLF2V — First-Last-Frame to Video (Wan 2.1 only)

⌚ ZeroGPU reservation: ~150s

Send to:

V2V — Video-to-Video Restyle

0.1 1

⌚ ZeroGPU reservation: ~90s

Send to:

VACE — Versatile Animation Control & Editing (Wan 2.1 only)

Sub-mode
Mask source

⌚ ZeroGPU reservation: ~180s

Send to:

S2V — Speech to Video (Wan 2.2, via upstream wan package)

Resolution

Duration: auto (driven by audio length)

⌚ ZeroGPU reservation: ~240s (audio-driven)

Send to:

Animate — Character Animation & Replacement (Wan 2.2)

Mode
Resolution
1 20

⚠ Pose+face preprocessing runs on CPU before GPU (~30s extra).

⌚ ZeroGPU reservation: ~300s (xlarge tier)

Send to:

Settings — Model Manager

Model load status appears here.