My implementation of Image Inpainting - A deep learning Inpainting model

Last update: Dec 12, 2021

Related tags

Overview

Image Inpainting

What is Image Inpainting

Image inpainting is a restorative process that allows for the fixing or removal of unwanted parts within images. Typically, this process is done by professionals who use software to change the image to remove the imperfection painstakingly. A deep learning approach bypasses manual labor typically used in this process and applies a neural network to determine the proper fill for the parts of the image.

Examples

To see a higher quality version, click on the images

From left to right: original, interpolated, predicted

Reasearch and Development

The model architecture is created using a fully convolutional deep residual network. I had pretty good intuition that this type of model would work, as it had on my previous projects for image restoration. I looked into other architectures such as UNET for inpainting but ran into troubles while implementing them.

First, UNET requires you to splice images during inference, meaning that the image splice had to be larger than the white space that the user is trying to inpaint. For example, if the splices you set up for inference were set up to take 64x64 chunks of the image and you managed to get whitespace that fully engulfed this splice, feeding this into the model would result in improper pixels due to the model not having any reference. This would require a different architecture that would detect the size of the white space for images so that you could adequately select the image splice size.

The following architecture I looked into and tried implementing was a GAN (Generative Adversarial Network) based model. I've experimented with GANs and implemented a model that could generate faces using images from the CelebA dataset; however, using GANs for Inpainting proved a much more complex problem. There are issues that I faced with proper ratios of the loss functions being L1 loss and the adversarial loss of the discriminator. Although a GAN-based model would likely drastically improve the output during inference, I could not tune the hyper-parameters enough to balance both the loss functions and the training of the generator and discriminator.

I resolved to use the current architecture described due to its simplicity and relatively adequate results.

Model Architecture

Methods	Depth	Filters	Parameters	Training Time
Inpaint Model	50	(49 layers) 192-3	15,945k	~30hrs

Network Architecture:

How do you use this model?

Due to the sheer size of this model, I can't fully upload it onto GitHub. Instead, I have opted to upload it via Google Drive, where you should be able to download it. Place this download '.h5' file and place it inside the 'weights/' directory.

How can you train your own model?

The model is instantiated within network.py. You can play around with hyper-parameters there. First, to train the model, delete the images currently within data/ put your training image data within that file - any large dataset such as ImageNet or an equivalent should work. Finally, mess with hyper-parameters in train.py and run train.py. If you’re training on weaker hardware, I’d recommend lowering the batch_size below the currently set 4 images.

My implementation of Image Inpainting - A deep learning Inpainting model

Related tags

Overview

Image Inpainting

What is Image Inpainting

Examples

From left to right: original, interpolated, predicted

Reasearch and Development

Model Architecture

Network Architecture:

How do you use this model?

How can you train your own model?

Qualitative Examples (click on the images for higher quality):

Set 5 Evaluation Set:

Hardware - Training Statistics

Trained on 3070 ti

Batch Size: 4

Training Image Size: 96x96

Author

Joshua Evans - github/JoshVEvans

Owner

Joshua V Evans

Motion planning environment for Sampling-based Planners

Resources for our AAAI 2022 paper: "LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification".

A simple, fully convolutional model for real-time instance segmentation.

Face Recognition Attendance Project

Code for "The Box Size Confidence Bias Harms Your Object Detector"

Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.

Compare outputs between layers written in Tensorflow and layers written in Pytorch

The code written during my Bachelor Thesis "Classification of Human Whole-Body Motion using Hidden Markov Models".

Unofficial PyTorch implementation of SimCLR by Google Brain

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

Open source annotation tool for machine learning practitioners.

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

Pytorch code for "Text-Independent Speaker Verification Using 3D Convolutional Neural Networks".

This is a repository of our model for weakly-supervised video dense anticipation.

Towards Representation Learning for Atmospheric Dynamics (AtmoDist)