Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Last update: Dec 02, 2022

Related tags

Deep Learning LMFD-PAD

Overview

LMFD-PAD

Note

This is the official repository of the paper: LMFD-PAD: Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection. The paper can be found in here.

Pipeline Overview

Data preparation

Since the data in all used PAD datasets in our work are videos, we sample 10 frames in the average time interval of each video. In addition, the ratio of bona fide and attack is balanced by simple duplication. Finally, CSV files are generated for further training and evaluation. The format of the dataset CSV file is:

image_path,label
/image_dir/image_file_1.png, bonafide
/image_dir/image_file_2.png, bonafide
/image_dir/image_file_3.png, attack
/image_dir/image_file_4.png, attack

Training

The training code for intra-dataset and cross-dataset experiments is same, the difference code between intra_db_main.py and cross_db_main.py is evaluation metrics.

Example of intra-dataset training and testing:

python intra_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Example of cross-dataset training and testing is similar:

python cross_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Results

The results of cross-dataset evaluation under different experimental settings on four face PAD datasets. More details can be found in paper.

Models

Four models pre-trained based on four cross-dataset experimental settings can be download via google driver.

if you use LMFD-HAM architecture in this repository, please cite the following paper:

@misc{fang2021learnable,
    title={Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection},
    author={Meiling Fang and Naser Damer and Florian Kirchbuchner and Arjan Kuijper},
    year={2021},
    eprint={2109.07950},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Related tags

Overview

LMFD-PAD

Note

Pipeline Overview

Data preparation

Training

Results

Models

Owner

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Event-forecasting - Event Forecasting Algorithms With Python

Code accompanying the paper "Wasserstein GAN"

The official repository for "Score Transformer: Generating Musical Scores from Note-level Representation" (MMAsia '21)

ISNAS-DIP: Image Specific Neural Architecture Search for Deep Image Prior [CVPR 2022]

A Protein-RNA Interface Predictor Based on Semantics of Sequences

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)

An algorithmic trading bot that learns and adapts to new data and evolving markets using Financial Python Programming and Machine Learning.

Keywords : Streamlit, BertTokenizer, BertForMaskedLM, Pytorch

Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun

A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.

EfficientMPC - Efficient Model Predictive Control Implementation

Depth-Aware Video Frame Interpolation (CVPR 2019)

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes

SAGE: Sensitivity-guided Adaptive Learning Rate for Transformers

[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Semi-Supervised Learning, Object Detection, ICCV2021

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest