⌚ ZeroGPU reservation: ~110s
⌚ ZeroGPU reservation: ~120s
TI2V — Text+Image to Video (Wan 2.2-5B, via upstream wan package)
TI2V-5B is locked to 1280×704 (landscape) or 704×1280 (portrait), 121 frames @ 24 fps.
⌚ ZeroGPU reservation: ~60s
FLF2V — First-Last-Frame to Video (Wan 2.1 only)
⌚ ZeroGPU reservation: ~150s
V2V — Video-to-Video Restyle
⌚ ZeroGPU reservation: ~90s
VACE — Versatile Animation Control & Editing (Wan 2.1 only)
⌚ ZeroGPU reservation: ~180s
S2V — Speech to Video (Wan 2.2, via upstream wan package)
Duration: auto (driven by audio length)
⌚ ZeroGPU reservation: ~240s (audio-driven)
Animate — Character Animation & Replacement (Wan 2.2)
⚠ Pose+face preprocessing runs on CPU before GPU (~30s extra).
⌚ ZeroGPU reservation: ~300s (xlarge tier)
Gallery — session history
Generated videos will appear here.
Model load status appears here.