PhotoMaker: Create a new, personalized image of a person

Can generate photos of people matching the description according to the description.
It is also possible to mix together the photo features of several different people to create a completely new character.
It can also change the gender, age, and generate additional photos in multiple styles.
Fast and realistic, the effect is natural.

Main functions:

1. Create a photo based on the text description: You can tell it what you want the person to look like, and it will generate a photo of the person that matches the description.
2. Mixing the characteristics of multiple people: It can combine the photo characteristics of several different people to create a brand new character image.
3. Change age or gender: For example, you can transform a photo of a man into a photo of a woman, or make the person in the photo look young or old.
4. Add artistic style: It can also add different artistic effects to photos, such as oil painting style or cartoon style.
5. Realizing characters in artworks/old photos: Bringing characters from artworks or old photos into the modern era.

Here is an example to explain how PhotoMaker works:

Suppose you have three photos:

A photo of you.
A photo of your favorite movie star.
A photo of a person wearing historical clothing.

Now, you want to create a new photo of you, but with certain characteristics of a movie star (such as hairstyle or expression), and wearing historical clothing. You can upload these three photos to PhotoMaker.

PhotoMaker works like this:

It first analyzed the three photos and identified the characteristics of the characters in each photo, such as facial features, hairstyles, and clothing styles.

Then, it combines these features to create a new image. This image is basically your face, but may have the hairstyle and expression of a movie star, while wearing the costume in that historic costume photo.

The end result is a completely new photo that shows a unique persona: one that looks like you, but with the characteristics and style of the people in other photos. This is the core function of PhotoMaker – combining the characteristics of multiple different photos to create a new, personalized image.

Main features:

The core technology used by PhotoMaker is “stacked ID embedding,” which means it can combine information from multiple ID images into a unified data structure. This approach not only captures the subtle features of a single ID, but also integrates features from multiple different IDs to create new, personalized images.

1. Combine the features of multiple photos: PhotoMaker can take multiple photos, which can be of the same person or different people. It analyzes the features in these photos and then combines them to create a new, personalized image of a person.
2. Flexible text control: Compared with other methods, PhotoMaker provides more flexible text control capabilities. Users can guide the generation of images through text prompts, which makes the created images not only realistic, but also better meet the specific requirements of the user.
3. Efficient and fast generation: Compared to methods that require a lot of fine-tuning (such as DreamBooth), PhotoMaker provides faster generation speed while maintaining identity information. It can generate customized human photos in a short time (e.g. 10 seconds), which is very beneficial for application scenarios that require fast generation of results.
4. High Identity Fidelity: PhotoMaker can better preserve the identity of the person when generating the image. This is especially important for scenes that require accurate preservation of character characteristics, such as personalized portraits or identification.

Projects and Demonstrations:https://photo-maker.github.io
Paper:https://huggingface.co/papers/2312.04461
GitHub:https://github.com/TencentARC/PhotoMaker

Online experience:

https://huggingface.co/spaces/TencentARC/PhotoMaker
https://huggingface.co/spaces/TencentARC/PhotoMaker-Style

Video:

Scroll to Top