MyNiuuu
/

MOFA-Video-Hybrid

Model card Files Files and versions Community

MOFA-Video-Hybrid / README.md

MyNiuuu's picture

Update README.md

d79654c verified 5 months ago

|

1.64 kB

	---
	license: apache-2.0
	---


	## Paper

	arxiv.org/abs/2405.20222


	## Introduction

	This repo provides the inference Gradio demo for Hybrid (Trajectory + Landmark) Control of [MOFA-Video](https://myniuuu.github.io/MOFA_Video/).

	## Environment Setup

	```
	cd MOFA-Hybrid
	conda create -n mofa python==3.10
	conda activate mofa
	pip install -r requirements.txt
	pip install opencv-python-headless
	pip install "git+https://github.com/facebookresearch/pytorch3d.git"
	```

	IMPORTANT: Gradio Version of 4.5.0 should be used since other versions may cause errors.


	## Checkpoints Download
	1. Download the checkpoint of CMP from [here](https://huggingface.co/MyNiuuu/MOFA-Video-Hybrid/blob/main/models/cmp/experiments/semiauto_annot/resnet50_vip%2Bmpii_liteflow/checkpoints/ckpt_iter_42000.pth.tar) and put it into `./models/cmp/experiments/semiauto_annot/resnet50_vip+mpii_liteflow/checkpoints`.

	2. Downloading the necessary pretrained checkpoints from [huggingface](https://huggingface.co/MyNiuuu/MOFA-Video-Hybrid). It is recommended to directly using git lfs to clone the [huggingface repo](https://huggingface.co/MyNiuuu/MOFA-Video-Hybrid). The checkpoints should be orgnized as `./ckpt_tree.md` (they will be automatically organized if you use git lfs to clone the [huggingface repo](https://huggingface.co/MyNiuuu/MOFA-Video-Hybrid)).


	## Run Gradio Demo

	### Using audio to animate the facial part

	`python run_gradio_audio_driven.py`

	### Using refernce video to animate the facial part

	`python run_gradio_video_driven.py`

	IMPORTANT: Please refer to the instructions on the gradio interface during the inference process.