Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Last update: Dec 22, 2022

Related tags

Overview

Shuwa Gesture Toolkit

Shuwa (手話) is Japanese for "Sign Language"

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos. It is particularly useful for recognizing basic words in sign language. We collected thousands of example videos of people signing Japanese Sign Language (JSL) and Hong Kong Sign Language (HKSL) to train the baseline model for recognizing gestures and facial expressions.

The Shuwa Gesture Toolkit also allows you to train new gestures, so it can be trained to recognize any sign from any sign language in the world.

[Web Demo]

How it works

By combining pose, face, and hand detector results over multiple frames we can acquire a fairly requirement for sign language understanding includes body movement, facial movement, and hand gesture. After that we use DD-Net as a recognitor to predict sign features represented in the 832D vector. Finally using use K-Nearest Neighbor classification to output the class prediction.

All related models listed below.

PoseNet: Pose detector model.
FaceMesh : Face keypoints detector model.
HandLandmarks : Hand keypoints detector model.
DD-Net : Skeleton-based action recognition model.

Installation

For MacOS user
Install python 3.7 from official python.org for tkinter support.
Install dependencies
```
pip3 install -r requirements.txt 
```

Run Python Demo

python3 webcam_demo_knn.py

Use record mode to add more sign.
Play mode.

Run Detector demo

You can try each detector individually by using these scripts.

FaceMesh

python3 face_landmark\webcam_demo_face.py

PoseNet

python3 posenet\webcam_demo_pose.py

HandLandmarks

python3 hand_landmark\webcam_demo_hand.py

Deploy on the Web using Tensorflow.js

Instructions here

Train classifier from scratch

You can add a custom sign by using Record mode in the full demo program.
But if you want to train the classifier from scratch you can check out the process here

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Related tags

Overview

Shuwa Gesture Toolkit

How it works

Installation

Run Python Demo

Run Detector demo

Deploy on the Web using Tensorflow.js

Train classifier from scratch

Owner

Google

Dual Attention Network for Scene Segmentation (CVPR2019)

The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

ReferFormer - Official Implementation of ReferFormer

Create Data & AI apps in 20 lines of code with Shimoku

Implementation of Axial attention - attending to multi-dimensional data efficiently

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

OntoProtein: Protein Pretraining With Ontology Embedding

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Awesome Human Pose Estimation

Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Pure python implementation reverse-mode automatic differentiation

Code & Data for Enhancing Photorealism Enhancement

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Related tags

Overview

Shuwa Gesture Toolkit

How it works

Installation

Run Python Demo

Run Detector demo

Deploy on the Web using Tensorflow.js

Train classifier from scratch

Owner

Google

Dual Attention Network for Scene Segmentation (CVPR2019)

The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

ReferFormer - Official Implementation of ReferFormer

Create Data & AI apps in 20 lines of code with Shimoku

Implementation of Axial attention - attending to multi-dimensional data efficiently

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

OntoProtein: Protein Pretraining With Ontology Embedding

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Awesome Human Pose Estimation

Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Pure python implementation reverse-mode automatic differentiation

Code & Data for Enhancing Photorealism Enhancement

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务