This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

Last update: Jan 25, 2022

Related tags

Overview

PeekingDuckling

1. Description

This is an implementation of facial identification algorithm to detect and identify the faces of the 3 team members Clarence, Eric Lee and Eric Kwok from other detected faces (Others).

We will be using the PeekingDuck framework for this mini project.

1.1 Example

2. Usage

2.1 Running the PeekingDuck nodes directly

python -m src.runner

usage: runner.py [-h] [--type {live_video,recorded_video,live_video_and_save}] [--input_filepath INPUT_FILEPATH] [--input_source INPUT_SOURCE] [--save_video_path SAVE_VIDEO_PATH] [--fps FPS]

Facial Recoginition algorithm

optional arguments:
  -h, --help            show this help message and exit
  --type {live_video,recorded_video,live_video_and_save}
                        Whether to use live webcam video or from a recorded video, or from a live webcam video and saving the recorded frames as a video file.
  --input_filepath INPUT_FILEPATH
                        The path to your video files if --type is 'recorded_video'
  --input_source INPUT_SOURCE
                        Input source integer value. Refer to cv2 VideoCapture class. Applicable for --type ['live_video' | 'live_video_and_save']
  --save_video_path SAVE_VIDEO_PATH
                        Path for video to be saved. Applicable for --type 'live_video_and_save'
  --fps FPS             Frames per second for video to be saved. Applicable for --type 'live_video_and_save'

2.2 Using the PeekingDuck from the web interface

python -m src.camera

2.3 Face recognition using only 1 photo

python -m src.app

On a separate terminal, issue the following command

python -m src.python_client <path_to_your_image>

3. Model

3.1 Face Detection

In this repository, we will be using the the library from PeekingDuck to perform facial detection.

For the face detection, the MTCNN pretrained model from the PeekingDuck's framework was being implemented.

3.2 Face Identification

For face identification, cropped images (224 x 224) obtained from Face detection stage is passed to the pretrained RESNET50 model (trained on VGGFace2 dataset) with a global average pooling layer to obtain the Face Embedding. The face embedding is then used to compare to the database of face embeddings obtained from the members to verify if the detected face belongs to one of the 3 members.
Comparison of the face embedding is done using a 1-NN model, and a threshold is set using cosine similarity, below which the image will be classified as 'others'

The face embeddings were built using 651 images from Clarence, 644 images from Eric Kwok and 939 images from Eric Lee.

A low dimensional representation of the face embedding database of the 3 members using the first 2 principal components from the PCA of the face embeddings can be found in the image below.

Augmentation to have the 4 extra images per image using random rotations of (+/-) 20 degrees and random contrasting were used in building the database so that it can be more robust. The PCA of the augmented database can be seen in the image below

4. Performance

The facial classification algorithm was able to achieve an overall accuracy of 99.4% and a weighted F1 score of 99.4% with 183 test images from Clarence, 179 from Eric Kwok, 130 from Eric Lee and 13,100 images from non-members obtained from this database.

Below shows the confusion matrix from the test result.
.

The test was conducted with the tuned threshold on the validation dataset, and the performance of the model with various thresholds can be seen in the graph below. The threshold that yields the best performance is around 0.342.

5. Authors and Acknowledgements

The authors would like to thank the mentor Lee Ping for providing us with the technical suggestions as well as the inputs on the implementation of this project.

Authors:

Eric Kwok (Backend Face Identification)
Eric Lee (Implementation of PeekingDuck Framework)
Clarence Lam (Frontend Web Interface)

References (Non exhausive)

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

Related tags

Overview

PeekingDuckling

1. Description

1.1 Example

2. Usage

2.1 Running the PeekingDuck nodes directly

2.2 Using the PeekingDuck from the web interface

2.3 Face recognition using only 1 photo

3. Model

3.1 Face Detection

3.2 Face Identification

4. Performance

5. Authors and Acknowledgements

Owner

Eric Kwok

Implementation of SwinTransformerV2 in TensorFlow.

House_prices_kaggle - Predict sales prices and practice feature engineering, RFs, and gradient boosting

This repository contains the reference implementation for our proposed Convolutional CRFs.

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

The official repository for paper ''Domain Generalization for Vision-based Driving Trajectory Generation'' submitted to ICRA 2022

This repository is an implementation of paper : Improving the Training of Graph Neural Networks with Consistency Regularization

Annotated notes and summaries of the TensorFlow white paper, along with SVG figures and links to documentation

The UI as a mobile display for OP25

An implementation of "Optimal Textures: Fast and Robust Texture Synthesis and Style Transfer through Optimal Transport"

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)

A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.

Semantic Edge Detection with Diverse Deep Supervision

Analysing poker data from home games with friends

Pytorch ImageNet1k Loader with Bounding Boxes.

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

Keras-retinanet - Keras implementation of RetinaNet object detection.