metadata

tags:
  - text-to-image
  - stable-diffusion
  - dress
  - try on
license: apache-2.0
language:
  - en
library_name: diffusers

IMAGDressing: Interactive Modular Apparel Generation for Dressing

IMAGDressing-v1: Customizable Virtual Dressing

Project Page | Paper | Code| Data

Introduction

To address the need for flexible and controllable customizations in virtual try-on systems, we propose IMAGDressing-v1. Specifically, we introduce a garment UNet that captures semantic features from CLIP and texture features from VAE. Our hybrid attention module includes a frozen self-attention and a trainable cross-attention, integrating these features into a frozen denoising UNet to ensure user-controlled editing. We will release a comprehensive dataset, IGv1, with over 200,000 pairs of clothing and dressed images, and establish a standard data assembly pipeline. Furthermore, IMAGDressing-v1 can be combined with extensions like ControlNet, IP-Adapter, T2I-Adapter, and AnimateDiff to enhance diversity and controllability.