Combining segmentation and Inpainting

A project in Image Segmentation and Inpainting. Extension of LaMa Paper, LaMa Github.

In this project we aim to augment the LaMa project. In the original paper the input consists of a pair of high resolution image and a binary mask. We propose to auto-generate the input masks using a segmentation neural network and thus making the task fully automated.

In our Project we use DeepLabV3 model to segment the images. The pre-trained model has been trained on a subset of COCO train2017, on the 20 categories that are present in the Pascal VOC dataset.

Written by George Pisha and Men Yevgeniy.

Getting Started

Clone the Repo:

git clone https://github.com/yevgm/Combining-segmentation-and-Inpainting
cd Combining-segmentation-and-Inpainting

Test Dataset

The following links will download the data folders:

Test Dataset - Contains test dataset for three classes (dog, bus, person) and their manual segmentaton masks, automatic segmentation masks and the output
LaMa fourier model - Pretrained LaMa model (same as the original)

Prerequisites

Setup conda

conda env create -f env.yml
conda activate seg_inpaint
pip install pyyaml==5.4.1

This will create a working environment named 'seg_inpaint'

Test images and pretrained models can be downloaded via the links above

Running the Code

To run the inpainting pipeline run command 2
- -c to choose the class integer. CHOSEN_CLASS can be chosen via command 1
- -i to provide full path to the images
- ./test_images is the path of model input
- --lama-model-path is the lama-fourier pretrained model path
- --lama-model-name is the filename of the model
If you wish to run the video temporal inconsistency pipeline:
- You must use the config './src/video_seg/video_config.yaml'
- The model will be trained on the video frames that must be present in 'output' directory in the repository root
- Just run command (3)

1. Print avaliable classes to remove from an image

python ./main.py -a print_cls

2. Run the inpainting pipeline

python ./main.py -a inpaint -c CHOSEN_CLASS -i $(pwd)/test_images --lama-model-path $(pwd)/lama-fourier --lama-model-name best.ckpt

3. Run the video temporal inconsistency pipeline

export PYTHONPATH=.
python ./src/video_seg/train.py

After running the inpainting command (2), two directories will be created:

input - which will include the original images alongside their semantic segmentation mask
output - which will include the inpainted images

After running the temporal inconsistency pipeline (3):

results - will contain the output video
logs - will include training logs
a model checkpoint will be saved

Numerical Evaluation

To calculate the numerical results on the whole dataset run:

Download the test images from the link above
Run the following command

export PYTHONPATH=.
python src/segmentation_comparison.py -t ../../test_data_comparison

Which will calculate the LPIPS distance for every class between the original image and the inpainted image, for semantic segmentation mode (auto) and manual mask generation.

Numerical Results

Class	Manual (LPIPS)	Segmentation (LPIPS)
dog	0.1261	0.1314
bus	0.1013	0.1018
person	0.1626	0.1584

Extending to Video

We also extend LaMa to video, by building a pipeline to feed videos. Additionally, to improve temporal consistency we also add an optional training step. This step is using internal learning and the Deep Image Prior concept to create video temporal consistency.

python ./main.py -a inpaint -c CHOSEN_CLASS -i $(pwd)/test_images --lama-model-path $(pwd)/lama-fourier --lama-model-name best.ckpt

Then run

python ./src/video_seg/train.py -i ./video_imgs

Video Results - Original

Video Results - LaMa

Video Results - Post Processed

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Combining segmentation and Inpainting

Getting Started

Test Dataset

Prerequisites

Running the Code

1. Print avaliable classes to remove from an image

2. Run the inpainting pipeline

3. Run the video temporal inconsistency pipeline

Numerical Evaluation

Numerical Results

Extending to Video

Video Results - Original

Video Results - LaMa

Video Results - Post Processed

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
examples		examples
lama		lama
results		results
src		src
test_images		test_images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
env.yml		env.yml
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

Combining segmentation and Inpainting

Getting Started

Test Dataset

Prerequisites

Running the Code

1. Print avaliable classes to remove from an image

2. Run the inpainting pipeline

3. Run the video temporal inconsistency pipeline

Numerical Evaluation

Numerical Results

Extending to Video

Video Results - Original

Video Results - LaMa

Video Results - Post Processed

License

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages