Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

Last update: Dec 19, 2022

Overview

Occlusion Robust 3D face Reconstruction

Yeong-Joon Ju, Gun-Hee Lee, Jung-Ho Hong, and Seong-Whan Lee

Code for Occlusion Robust 3D Face Reconstruction in "Complete Face Recovery GAN: Unsupervised Joint Face Rotation and De-Occlusion from a Single-View Image (WACV 2022)"

We propose our novel two stage fine-tuning strategy for occlusion-robust 3D face reconstruction. The training method is split into two training stages due to the difficulty of initial training for extreme occlusions. We fine-tune the baseline with our newly created datasets in the first stage and with teacher-student learning method in the second stage.

Our baseline is Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set and we also referred this code. Note that we focus on alignments and colors for guidance of CFR-GAN in occluded facial images.

Requirements

Python 3.7 or 3.8 can be used.
```
pip install -r requirements.txt
```
Install the Pytorch3D==0.2.5
Basel Face Model 2009 (BFM09) and Expression Basis (transferred from Facewarehouse by Guo et al.). The original BFM09 model does not handle expression variations so extra expression basis are needed.
- However, we made BFM_model_80.mat (Dimension of id coef and tex coef is 80). Download and move mmRegressor/BFM folder.

Usage

Preprocessing:

Prepare your own dataset for data augmentation. The datasets used in this paper can be downloaded in follows:

Download links: CelebA, 300W-LP, Multi-PIE (cropped version in CR-GAN)

Except when the dataset has facial landmarks labels, you should predict facial landmarks. We recommend using 3DDFA v2. If you want to reduce error propagation of the facial alignment networks, prepend a flag to filename. (ex) "pred"+[filename])

In order to train occlusion-robust 3D face model, occluded face image datasets are essential, but they are absent. So, we create datasets by synthesizing the hand-shape mask.

python create_train_stage1.py --img_path [your image folder] --lmk_path [your landmarks folder] --save_path [path to save]

For first training stage, prepare occluded (augmented images), ori_img (original images), landmarks (3D landmarks) folders or modify folder name in train_stage1.py.

**You must align images with align.py**

meta file format is:

[filename] left eye x left eye y right eye x right eye y nose x nose y left mouth x left mouth y ...

You can use MTCNN or RetinaFace

First Fine-tuning Stage:

Instead of skin mask, we use BiseNet, face parsing network. The codes and weights were modified and re-trained from this code.

Download weights of face parsing networks to faceParsing folder.
Download weights of baseline 3D networks to mmRegressor/network folder.

Train occlusion-robust 3D face model

python train_stage1.py

To show logs

tensorboard --logdir=logs_stage1 --bind_all --reload_multifile True

Second Fine-tuning Stage:

You can download MaskedFaceNet dataset in here.
You can download FFHQ dataset in here.

Train

python train_stage2.py

To show logs

tensorboard --logdir=logs_stage2 --bind_all --reload_multifile True

Evaluation

python evaluation/benchmark_nme_aflw_2000.py

If you would like to evaluate your results, please refer evaluation/estimate_aflw2000.py

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

Related tags

Overview

Occlusion Robust 3D face Reconstruction

Requirements

Usage

Preprocessing:

First Fine-tuning Stage:

Second Fine-tuning Stage:

Evaluation

Owner

Yeongjoon

Generating Radiology Reports via Memory-driven Transformer

Efficiently computes derivatives of numpy code.

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

Yet another video caption

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

An offline deep reinforcement learning library

Implementation of GGB color space

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

Clustergram - Visualization and diagnostics for cluster analysis in Python

Kinetics-Data-Preprocessing

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

GAN JAX - A toy project to generate images from GANs with JAX

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

A Framework for Encrypted Machine Learning in TensorFlow

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

AI that generate music

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

CrossNorm and SelfNorm for Generalization under Distribution Shifts (ICCV 2021)