My first day at CMU, according to Stable Diffusion

This repository contains the code I used to generate images of myself on my hypothetical first day at CMU, using Stable Diffusion.

For privacy reasons, I am not uploading the fine-tuned models nor the pictures of myself that I used to train them, as anyone could generate images of me using them. However, I am uploading the code I used to generate the images, as well as the images of CMU's School of Computer Science that I used to let the model know how it looks like (see cmu_cs_images).

Environment setup

To start with, follow Hugging Face's tutorial on fine-tuning DreamBooth to set up your local environment. Then, to avoid overfitting, you should train the model with a prior-preserving loss. For this purpose, I used the person_ddim dataset, which you can download as follows:

git clone https://github.com/djbielejeski/Stable-Diffusion-Regularization-Images-person_ddim.git

mkdir -p regularization_images/person_ddim
mv -v Stable-Diffusion-Regularization-Images-person_ddim/person_ddim/*.* regularization_images/person_ddim

Fine-tuning

I fine-tuned the model in two stages. First, I trained it with ~30 pictures of myself and with prior-preserving loss, using train_person.py. Once the model was able to generate good-quality images of me, I fine-tuned this model with the 5 pictures of CMU's School of Computer Science that I uploaded to this repository, using train_building.py.

Generating images

Once the model is fine-tuned, inference.ipynb can be used to generate images of me on my first day at CMU. The images generated by the model are saved in the potential_images directory.

Fun fact

A variety of prompts were tried until reaching the desired results. At the beginning of this process, my generated face was very noisy for photo-realistic images. However, the quality of my generated face was very good when generating fictional images or in a cartoon-like style. Surprisingly, the quality of the face radically improved when I asked the model to generate images of a person looking similar to me. I guess there are some internal checks inside the model to avoid generating realistic images of people, in a effort to avoid DeepFakes. But it seems that one can get over these checks quite easily.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cmu_cs_images		cmu_cs_images
potential_images		potential_images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.ipynb		inference.ipynb
train_building.py		train_building.py
train_person.py		train_person.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

My first day at CMU, according to Stable Diffusion

Environment setup

Fine-tuning

Generating images

Fun fact

About

Releases

Packages

Languages

License

atmguille/guille-at-cmu

Folders and files

Latest commit

History

Repository files navigation

My first day at CMU, according to Stable Diffusion

Environment setup

Fine-tuning

Generating images

Fun fact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages