Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Last update: Dec 02, 2022

Related tags

Deep Learning LMFD-PAD

Overview

LMFD-PAD

Note

This is the official repository of the paper: LMFD-PAD: Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection. The paper can be found in here.

Pipeline Overview

Data preparation

Since the data in all used PAD datasets in our work are videos, we sample 10 frames in the average time interval of each video. In addition, the ratio of bona fide and attack is balanced by simple duplication. Finally, CSV files are generated for further training and evaluation. The format of the dataset CSV file is:

image_path,label
/image_dir/image_file_1.png, bonafide
/image_dir/image_file_2.png, bonafide
/image_dir/image_file_3.png, attack
/image_dir/image_file_4.png, attack

Training

The training code for intra-dataset and cross-dataset experiments is same, the difference code between intra_db_main.py and cross_db_main.py is evaluation metrics.

Example of intra-dataset training and testing:

python intra_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Example of cross-dataset training and testing is similar:

python cross_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Results

The results of cross-dataset evaluation under different experimental settings on four face PAD datasets. More details can be found in paper.

Models

Four models pre-trained based on four cross-dataset experimental settings can be download via google driver.

if you use LMFD-HAM architecture in this repository, please cite the following paper:

@misc{fang2021learnable,
    title={Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection},
    author={Meiling Fang and Naser Damer and Florian Kirchbuchner and Arjan Kuijper},
    year={2021},
    eprint={2109.07950},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Related tags

Overview

LMFD-PAD

Note

Pipeline Overview

Data preparation

Training

Results

Models

Owner

Spectralformer: Rethinking hyperspectral image classification with transformers

A texturizer that I just made. Nothing special here.

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

Some toy examples of score matching algorithms written in PyTorch

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

Unofficial Implementation of Oboe (SIGCOMM'18').

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Mesh TensorFlow: Model Parallelism Made Easier

Notspot robot simulation - Python version

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

Fast (simple) spectral synthesis and emission-line fitting of DESI spectra.

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

RefineMask (CVPR 2021)

3D cascade RCNN for object detection on point cloud

Implementation of hyperparameter optimization/tuning methods for machine learning & deep learning models

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

Wanli Li and Tieyun Qian: Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction, IJCNN 2021

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021