F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Fast Real-time Object Detection with High-Res Output
Text-to-Video
Multimodal Image-to-Video
Apply the motion of a video on a portrait
Generate realistic talking heads from image+audio
High-quality virtual try-on ~ Your cyber fitting room