YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Last update: Jan 07, 2023

Related tags

Deep Learning yolov5-face

Overview

Introduction

Yolov5-face is a real-time,high accuracy face detection.

Performance

Single Scale Inference on VGA resolution（max side is equal to 640 and scale).

Large family

Method	Backbone	Easy	Medium	Hard	#Params(M)	#Flops(G)
DSFD (CVPR19)	ResNet152	94.29	91.47	71.39	120.06	259.55
RetinaFace (CVPR20)	ResNet50	94.92	91.90	64.17	29.50	37.59
HAMBox (CVPR20)	ResNet50	95.27	93.76	76.75	30.24	43.28
TinaFace (Arxiv20)	ResNet50	95.61	94.25	81.43	37.98	172.95
SCRFD-34GF(Arxiv21)	Bottleneck Res	96.06	94.92	85.29	9.80	34.13
SCRFD-10GF(Arxiv21)	Basic Res	95.16	93.87	83.05	3.86	9.98
-	-	-	-	-	-	-
YOLOv5s	CSPNet	94.67	92.75	83.03	7.075	5.751
YOLOv5s6	CSPNet	95.48	93.66	82.8	12.386	6.280
YOLOv5m	CSPNet	95.30	93.76	85.28	21.063	18.146
YOLOv5m6	CSPNet	95.66	94.1	85.2	35.485	19.773
YOLOv5l	CSPNet	95.78	94.30	86.13	46.627	41.607
YOLOv5l6	CSPNet	96.38	94.90	85.88	76.674	45.279

Small family

Method	Backbone	Easy	Medium	Hard	#Params(M)	#Flops(G)
RetinaFace (CVPR20	MobileNet0.25	87.78	81.16	47.32	0.44	0.802
FaceBoxes (IJCB17)		76.17	57.17	24.18	1.01	0.275
SCRFD-0.5GF(Arxiv21)	Depth-wise Conv	90.57	88.12	68.51	0.57	0.508
SCRFD-2.5GF(Arxiv21)	Basic Res	93.78	92.16	77.87	0.67	2.53
-	-	-	-	-	-	-
YOLOv5n	ShuffleNetv2	93.74	91.54	80.32	1.726	2.111
YOLOv5n-0.5	ShuffleNetv2	90.76	88.12	73.82	0.447	0.571

Pretrained-Models

Name	Easy	Medium	Hard	FLOPs(G)	Params(M)	Link
yolov5n-0.5	90.76	88.12	73.82	0.571	0.447	Link: https://pan.baidu.com/s/1UgiKwzFq5NXI2y-Zui1kiA pwd: s5ow, https://drive.google.com/file/d/1XJ8w55Y9Po7Y5WP4X1Kg1a77ok2tL_KY/view?usp=sharing
yolov5n	93.61	91.52	80.53	2.111	1.726	Link: https://pan.baidu.com/s/1xsYns6cyB84aPDgXB7sNDQ pwd: lw9j,https://drive.google.com/file/d/18oenL6tjFkdR1f5IgpYeQfDFqU4w3jEr/view?usp=sharing
yolov5s	94.33	92.61	83.15	5.751	7.075	Link: https://pan.baidu.com/s/1fyzLxZYx7Ja1_PCIWRhxbw Link: eq0q,https://drive.google.com/file/d/1zxaHeLDyID9YU4-hqK7KNepXIwbTkRIO/view?usp=sharing
yolov5m	95.30	93.76	85.28	18.146	21.063	Link: https://pan.baidu.com/s/1oePvd2K6R4-gT0g7EERmdQ pwd: jmtk
yolov5l	95.78	94.30	86.13	41.607	46.627	Link: https://pan.baidu.com/s/11l4qSEgA2-c7e8lpRt8iFw pwd: 0mq7

Data preparation

Download WIDERFace datasets.
Download annotation files from google drive.

python3 train2yolo.py
python3 val2yolo.py

Training

CUDA_VISIBLE_DEVICES="0,1,2,3" python3 train.py --data data/widerface.yaml --cfg models/yolov5s.yaml --weights 'pretrained models'

WIDERFace Evaluation

python3 test_widerface.py --weights 'your test model' --img-size 640

cd widerface_evaluate
python3 evaluation.py

Test

Android demo

https://github.com/FeiGeChuanShu/ncnn_Android_face/tree/main/ncnn-android-yolov5_face

References

https://github.com/ultralytics/yolov5

https://github.com/DayBreak-u/yolo-face-with-landmark

https://github.com/xialuxi/yolov5_face_landmark

https://github.com/biubug6/Pytorch_Retinaface

https://github.com/deepinsight/insightface

Citation

If you think this work is useful for you, please cite

@article{YOLO5Face,
title = {YOLO5Face: Why Reinventing a Face Detector},
author = {Delong Qi and Weijun Tan and Qi Yao and Jingfeng Liu},
booktitle = {ArXiv preprint ArXiv:2105.12931},
year = {2021}
}

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Related tags

Overview

Introduction

Performance

Pretrained-Models

Data preparation

Training

WIDERFace Evaluation

Test

Android demo

References

Citation

Owner

DeepCam Shenzhen

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

PyTorch Kafka Dataset: A definition of a dataset to get training data from Kafka.

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Fully Convolutional DenseNets for semantic segmentation.

TJU Deep Learning & Neural Network

Simulated garment dataset for virtual try-on

Physics-Informed Neural Networks (PINN) and Deep BSDE Solvers of Differential Equations for Scientific Machine Learning (SciML) accelerated simulation

Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)

CSAC - Collaborative Semantic Aggregation and Calibration for Separated Domain Generalization

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

The first public PyTorch implementation of Attentive Recurrent Comparators

This is a computer vision based implementation of the popular childhood game 'Hand Cricket/Odd or Even' in python

PyKaldi GOP-DNN on Epa-DB

Interactive Visualization to empower domain experts to align ML model behaviors with their knowledge.

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning using 🤗 transformers

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

The UI as a mobile display for OP25

An LSTM based GAN for Human motion synthesis