GitHub - UdiGal/046211_project_repo: StyleGAN2-ADA - Official PyTorch implementation

Face Aging using Conditional GAN

Performing Face Aging (directed image generation) using StyleGAN2-ADA and InceptionResnetV1.

This repository is built upon the repositories of StyleGAN2-ADA Pytorch, Facenet PyTorch and Processed IMDB WIKI Dataset.

Final project to course 046211 - Deep Learning in the Technion
by Udi Gal & Moshe Rafaeli

Path	Description
`046211_project_repo`	Main directory
├ `directed_image_generation.ipynb`	Directed generation of image based on input image and class
├ `make_json_labels.py`	Create multiclass json labels file for stylegan2-ada-pytorch
├ `train_inception_resnet_v1.ipynb`	Setup and training for the InceptionResnetV1 encoder
└ `train_sg2_ada.ipynb`	Setup and training for the StyleGAN2-ADA generator

Directed image generation

To perform directed image generation, refer to the directed_image_generation.ipynb file and follow the instructions. In order to run it you will need:

A trained generator pickle file (generator_pkl)
A trained encoder pytorch file (encoder_pt)
An input image (input_path)

Training the Generator

To train the generator over different dataset, refer to the train_sg2_ada.ipynb file and follow the instructions. In order to run it you will need:

A dataset to train on (dataset_path)
A starting point trained generator (optional - resume_from)
The training code is located at train.py with all the relevant documentation

Training the Encoder

To train the encoder again, refer to the train_inception_resnet_v1.ipynb file and follow the instructions. In order to run it you will need:

A trained generator pickle file (generator_pkl)
You can optimize the training hyper-parameters for your needs.

How does it work

The goal of the project is to enable converting between the image space and latent space easily, and with these conversions perform the directed generation.

Generation: Converting between Latent Space to Image Space

We use a conditional StyleGAN2-ADA model as our generator.
We trained the generator over the IMDB-WIKI dataset (after pre-processing) labeled with age groups of [0-10], [11-19], [20-29], [30-39], [40-49], [50-59], [60-69], [70-79], [80-89], [90+].
After we finished training we have a generator that can generate faces according to latent variables and a class.

Encoding: Converting between Image Space to Latent Space

We use an InceptionResnetV1 model as our encoder. The goal of the encoder is to learn the distribution of the latent space that was learned by the generator, and to reproduce the latent variables from an image.
We trained the encoder over 10,000 randomly generated images from the generator and their corresponding latent variables, using MSE to calculate the loss between the encoding and the actual latent variables.
It is important to see that the encoder does not consider the class of the image as and input, and that is to ensure that the encoder will be able to identify two images with the same latent variables but different class.

Directed Generation: Combining the Encoder and the Generator

Now we have a generator and en encoder, so we can use them to perform the directed generation.

For a given image, the encoder calculates the latent variables of input image.
Then, the latent variables and the given class go into the generator to create the appropriate image that will transfer the class of the input image to the new class, and hopefully preserve the characteristics of the input image.

References:

StyleGAN2-ADA Pytorch - Nvidia Labs, October 2021
Facenet PyTorch - Tim Esler, December 2021
Processed IMDB WIKI Dataset - Abhishek Chatterjee, March 2019

License

This work is made available under the Nvidia Source Code License.

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.github		.github
dnnlib		dnnlib
docs		docs
facenet_encoder		facenet_encoder
metrics		metrics
output		output
processed-imdb-wiki-dataset		processed-imdb-wiki-dataset
torch_utils		torch_utils
training		training
util		util
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
dataset_tool.py		dataset_tool.py
directed_image_generation.ipynb		directed_image_generation.ipynb
generate.py		generate.py
legacy.py		legacy.py
make_json_labels.py		make_json_labels.py
projector.py		projector.py
train.py		train.py
train_inception_resnet_v1.ipynb		train_inception_resnet_v1.ipynb
traing_sg2_ada.ipynb		traing_sg2_ada.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Face Aging using Conditional GAN

Contents

Directed image generation

Training the Generator

Training the Encoder

How does it work

References:

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Face Aging using Conditional GAN

Contents

Directed image generation

Training the Generator

Training the Encoder

How does it work

References:

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages