arxiv:2311.12052

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

Published on Nov 18, 2023

· Submitted by

akhaliq on Nov 22, 2023

#1 Paper of the day

Upvote

Authors:

Di Chang ,

Yichun Shi ,

Quankai Gao ,

Jessica Fu ,

Guoxian Song ,

Xiao Yang ,

Abstract

In this work, we propose MagicDance, a diffusion-based model for 2D human motion and facial expression transfer on challenging human dance videos. Specifically, we aim to generate human dance videos of any target identity driven by novel pose sequences while keeping the identity unchanged. To this end, we propose a two-stage training strategy to disentangle human motions and appearance (e.g., facial expressions, skin tone and dressing), consisting of the pretraining of an appearance-control block and fine-tuning of an appearance-pose-joint-control block over human dance poses of the same dataset. Our novel design enables robust appearance control with temporally consistent upper body, facial attributes, and even background. The model also generalizes well on unseen human identities and complex motion sequences without the need for any fine-tuning with additional data with diverse human attributes by leveraging the prior knowledge of image diffusion models. Moreover, the proposed model is easy to use and can be considered as a plug-in module/extension to Stable Diffusion. We also demonstrate the model's ability for zero-shot 2D animation generation, enabling not only the appearance transfer from one identity to another but also allowing for cartoon-like stylization given only pose inputs. Extensive experiments demonstrate our superior performance on the TikTok dataset.

View arXiv page View PDF Add to collection