Papers
arxiv:2312.04461

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Published on Dec 7, 2023
Β· Submitted by akhaliq on Dec 8, 2023
#1 Paper of the day
Authors:
,

Abstract

Recent advances in text-to-image generation have made remarkable progress in synthesizing realistic human photos conditioned on given text prompts. However, existing personalized generation methods cannot simultaneously satisfy the requirements of high efficiency, promising identity (ID) fidelity, and flexible text controllability. In this work, we introduce PhotoMaker, an efficient personalized text-to-image generation method, which mainly encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information. Such an embedding, serving as a unified ID representation, can not only encapsulate the characteristics of the same input ID comprehensively, but also accommodate the characteristics of different IDs for subsequent integration. This paves the way for more intriguing and practically valuable applications. Besides, to drive the training of our PhotoMaker, we propose an ID-oriented data construction pipeline to assemble the training data. Under the nourishment of the dataset constructed through the proposed pipeline, our PhotoMaker demonstrates better ID preservation ability than test-time fine-tuning based methods, yet provides significant speed improvements, high-quality generation results, strong generalization capabilities, and a wide range of applications. Our project page is available at https://photo-maker.github.io/

Community

Paper author

Here are more results:

website_recontext_lowres.jpg

Β·

Super

Paper author

website_oldphoto_lowres.jpg

Paper author

stylization.jpg

This comment has been hidden

cool

This comment has been hidden

IMG_20240308_172908.jpg

Β·

an img of a guy that did not get how it works?

Uploading IMG_20240317_172611.jpg…

This comment has been hidden
This comment has been hidden
This comment has been hidden

PhotoMaker: Revolutionizing Custom Human Photos with AI Magic!

Links πŸ”—:

πŸ‘‰ Subscribe: https://www.youtube.com/@Arxflix
πŸ‘‰ Twitter: https://x.com/arxflix
πŸ‘‰ LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Models citing this paper 3

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2312.04461 in a dataset README.md to link it from this page.

Spaces citing this paper 165

Collections including this paper 28