GenerativeFaceCompletion

Matcaffe implementation of our CVPR17 paper on face completion.

In each panel from left to right: original face, masked input, completion result.

Setup

We use the caffe-for-cudnn-v2.5.48. Please refer Caffe for more installation details.
Basically, you need to first modify the MATLAB_DIR in Makefile.config and then run the following commands for a successful compilation:

make all -j4
make matcaffe

Training

Follow the DCGAN to prepare the data (CelebA). The only differece is that the face we cropped is of size 128x128. Please modify Line 10 in their crop_celebA.lua file. We use the standard train&test split of the CelebA dataset.
Modify the training data path in ./matlab/FaceCompletion_training/GFC_caffeinit.m file.
Download our face parsing model Model_parsing and put it under ./matlab/FaceCompletion_training/model/ folder.
We provide an initial model that is only trained with the reconstruction loss, as a good start point for the subsequent GAN training. Please download it and put it under ./matlab/FaceCompletion_training/model/ folder.
Run ./matlab/FaceCompletion_training/demo_GFC_training.m for training.

Testing

Download our face completion model Model_G and put it under ./matlab/FaceCompletion_testing/model/ folder.
Run ./matlab/FaceCompletion_testing/demo_face128.m for completion. TestImages are from the CelebA test dataset.

Citation

@inproceedings{GFC-CVPR-2017,
    author = {Li, Yijun and Liu, Sifei and Yang, Jimei and Yang, Ming-Hsuan},
    title = {Generative Face Completion},
    booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
    year = {2017}
}

Acknowledgement

Gratitude goes to Sifei Liu for the great help on code.
The upsample layer (unpooling according to the pooling mask) is borrowed from the SegNet.

The source code of CVPR17 'Generative Face Completion'.

Related tags

Overview

GenerativeFaceCompletion

Setup

Training

Testing

Citation

Acknowledgement

Owner

Yijun Li

Create Own QR code with Python

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

ParmeSan: Sanitizer-guided Greybox Fuzzing

Using OpenAI's CLIP to upscale and enhance images

Transformer Huffman coding - Complete Huffman coding through transformer

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

Understanding Hyperdimensional Computing for Parallel Single-Pass Learning

PyTorch code of paper "LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering"

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

GestureSSD CBAM - A gesture recognition web system based on SSD and CBAM, using pytorch, flask and node.js

Bling's Object detection tool

Llvlir - Low Level Variable Length Intermediate Representation

shufflev2-yolov5：lighter, faster and easier to deploy

Learning Spatio-Temporal Transformer for Visual Tracking

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

🇰🇷 Text to Image in Korean

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

A simple, fully convolutional model for real-time instance segmentation.