Unified learning approach for egocentric hand gesture recognition and fingertip detection

Last update: Dec 25, 2022

Overview

Unified Gesture Recognition and Fingertip Detection

A unified convolutional neural network (CNN) algorithm for both hand gesture recognition and fingertip detection at the same time. The proposed algorithm uses a single network to predict both finger class probabilities for classification and fingertips positional output for regression in one evaluation. From the finger class probabilities, the gesture is recognized, and using both of the information fingertips are localized. Instead of directly regressing the fingertips position from the fully connected (FC) layer of the CNN, we regress the ensemble of fingertips position from a fully convolutional network (FCN) and subsequently take ensemble average to regress the final fingertips positional output.

Update

Included robust real-time hand detection using yolo for better smooth performance in the first stage of the detection system and most of the code has been cleaned and restructured for ease of use. To get the previous versions, please visit the release section.

Requirements

TensorFlow-GPU==2.2.0
OpenCV==4.2.0
ImgAug==0.2.6
Weights: Download the pre-trained weights files of the unified gesture recognition and fingertip detection model and put the weights folder in the working directory.

The weights folder contains three weights files. The fingertip.h5 is for unified gesture recignition and finertiop detection. yolo.h5 and solo.h5 are for the yolo and solo method of hand detection. (what is solo?)

Paper

To get more information about the proposed method and experiments, please go through the paper. Cite the paper as:

@article{alam2021unified,
title = {Unified learning approach for egocentric hand gesture recognition and fingertip detection},
author={Alam, Mohammad Mahmudul and Islam, Mohammad Tariqul and Rahman, SM Mahbubur},
journal = {Pattern Recognition},
volume = {121},
pages = {108200},
year = {2021},
publisher={Elsevier},
}

Dataset

The proposed gesture recognition and fingertip detection model is trained by employing Scut-Ego-Gesture Dataset which has a total of eleven different single hand gesture datasets. Among the eleven different gesture datasets, eight of them are considered for experimentation. A detailed explanation about the partition of the dataset along with the list of the images used in the training, validation, and the test set is provided in the dataset/ folder.

Network Architecture

To implement the algorithm, the following network architecture is proposed where a single CNN is utilized for both hand gesture recognition and fingertip detection.

Prediction

To get the prediction on a single image run the predict.py file. It will run the prediction in the sample image stored in the data/ folder. Here is the output for the sample.jpg image.

Real-Time!

To run in real-time simply clone the repository and download the weights file and then run the real-time.py file.

directory > python real-time.py

In real-time execution, there are two stages. In the first stage, the hand can be detected by using either you only look once (yolo) or single object localization (solo) algorithm. By default, yolo will be used here. The detected hand portion is then cropped and fed to the second stage for gesture recognition and fingertip detection.

Output

Here is the output of the unified gesture recognition and fingertip detection model for all of the 8 classes of the dataset where not only each fingertip is detected but also each finger is classified.

Comments

Datasets

Hello， I have a question about the dataset from your readme, I can't download the Scut-Ego-Gesture Dataset ，Because in China, this website has been banned. Can you share it with me in other ways? For example, Google or QQ email: [email protected]

opened by CVUsers 10
how to download the weights, code not contain?

The weights folder contains three weights files. The comparison.h5 is for first five classes and performance.h5 is for first eight classes. solo.h5 is for hand detection. but no link

opened by mmxuan18 6
OSError: Unable to open file (unable to open file: name = 'yolo.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)

I use the Mac Os to run thereal-time.py file, and get the OSError, I also search on Google to find others' the same problem. It is probably the Keras problem. But I do not how to solve it

opened by Hanswanglin 4
OSError: Unable to open file (unable to open file: name = 'weights/performance.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)

File "h5py/_objects.pyx", line 54, in h5py._objects.with_phil.wrapper File "h5py/_objects.pyx", line 55, in h5py._objects.with_phil.wrapper File "h5py/h5f.pyx", line 88, in h5py.h5f.open OSError: Unable to open file (unable to open file: name = 'weights/performance.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)

opened by Jasonmes 2
left hand?

Hi, first it's really cool work!

Is the left hand included in the training images? I have been playing around with some of my own images and it seems that it doesn't really recognize the left hand in a palm-down position...

If I want to include the left hand, do you think it would be possible if I train the network with the image flipped?

opened by myhjiang 1
why are there two hand detection provided?

A wonderful work!!As mentioned above, the Yolo and Solo detection models are provided. I wonder what is the advatange of each model comparing to the other and what is the dataset to train the detect.

opened by DanielMao2015 1
Difference of classes5.h5 and classes8.h5

Hi, May i know the difference when training classes5 and classes8? are the difference from the dataset used for training by excluding SingleSix, SingleSeven, SingleEight or there are other modification such as changing the model structure or parameters?

Thanks

opened by danieltanimanuel 1

Using old versions of tensorflow, can't install the dependencies on my macbook and with newer versions it's constatly failing.

When trying to install the required version of tensorflow:

pip3 install tensorflow==1.15.0
ERROR: Could not find a version that satisfies the requirement tensorflow==1.15.0 (from versions: 2.2.0rc3, 2.2.0rc4, 2.2.0, 2.2.1, 2.2.2, 2.3.0rc0, 2.3.0rc1, 2.3.0rc2, 2.3.0, 2.3.1, 2.3.2, 2.4.0rc0, 2.4.0rc1, 2.4.0rc2, 2.4.0rc3, 2.4.0rc4, 2.4.0, 2.4.1)
ERROR: No matching distribution found for tensorflow==1.15.0

I even tried downloading the .whl file from the pypi and try manually installing it, but that didn't work too:

pip3 install ~/Downloads/tensorflow-1.15.0-cp37-cp37m-macosx_10_11_x86_64.whl
ERROR: tensorflow-1.15.0-cp37-cp37m-macosx_10_11_x86_64.whl is not a supported wheel on this platform.

Tried with both python3.6 and python3.8

So it would be great to update the dependencies :)

opened by KoStard 1

Custom Model keyword arguments Error

Change model = Model(input=model.input, outputs=[probability, position]) to model = Model(inputs=model.input, outputs=[probability, position]) on line 22 of net/network.py

opened by Rohit-Jain-2801 1
Problem of weights

Hi,when load the solo.h5(In solo.py line 14:"self.model.load_weights(weights)") it will report errors: Process finished with exit code -1073741819 (0xC0000005) keras2.2.5+tensorflow1.14.0+cuda10.0

opened by MC-E 1

Releases(v2.0)

v2.0(Aug 2, 2021)

Latest and updated version of the unified hand gesture recognition and fingertip detection. Most of the code has been cleaned and restructured for ease of use. Moreover, SOLO and YOLO hand detectors have been added.
Source code(tar.gz)
Source code(zip)
v1.0(Mar 25, 2020)

Source code(tar.gz)
Source code(zip)

Owner

Mohammad

Machine Learning | Graduate Research Assistant at CORAL Lab

GitHub Repository https://www.sciencedirect.com/science/article/pii/S0031320321003824

PyTorch implementation of adversarial patch

adversarial-patch PyTorch implementation of adversarial patch This is an implementation of the Adversarial Patch paper. Not official and likely to hav

172 Nov 29, 2022

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

40 Nov 18, 2022

Parameterising Simulated Annealing for the Travelling Salesman Problem

55 Jun 15, 2022

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

FSRA This repository contains the dataset link and the code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV

32 Dec 18, 2022

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks This repository implements a capsule model Inten

15 Dec 24, 2022

A high performance implementation of HDBSCAN clustering.

HDBSCAN HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates

2.3k Jan 02, 2023

Python scripts using the Mediapipe models for Halloween.

Mediapipe-Halloween-Examples Python scripts using the Mediapipe models for Halloween. WHY Mainly for fun. But this repository also includes useful exa

23 Jan 06, 2023

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Don’t be Contradicted with Anything!CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System This repository contains the PyTorch im

25 Sep 06, 2022

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Introduction Pytorch implementation of Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Expert. | paper Song Park1

97 Dec 23, 2022

Unified learning approach for egocentric hand gesture recognition and fingertip detection

Related tags

Overview

Unified Gesture Recognition and Fingertip Detection

Update

Requirements

Paper

Dataset

Network Architecture

Prediction

Real-Time!

Output

Comments

Releases(v2.0)

v2.0(Aug 2, 2021)

v1.0(Mar 25, 2020)

Owner

Mohammad

PyTorch implementation of adversarial patch

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Parameterising Simulated Annealing for the Travelling Salesman Problem

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

A high performance implementation of HDBSCAN clustering.

Python scripts using the Mediapipe models for Halloween.

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

A more easy-to-use implementation of KPConv

GANsformer: Generative Adversarial Transformers Drew A

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Code for the paper: "On the Bottleneck of Graph Neural Networks and Its Practical Implications"

Spectral Temporal Graph Neural Network (StemGNN in short) for Multivariate Time-series Forecasting

Code for paper [ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot] (ICCV 2021, oral))

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

A deep neural networks for images using CNN algorithm.

PyTorch-based framework for Deep Hedging

CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

(AAAI 2021) Progressive One-shot Human Parsing

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)