Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Last update: Aug 20, 2022

Related tags

Deep Learning Pose-Network

Overview

RealTime Sign Language Detection using Action Recognition

Approach

Real-Time Sign Language is commonly predicted using models whose architecture consists of multiple CNN layers followed by multiple LSTM layers. However , the accuracy of these state of the art models is pretty low. On the other hand, this approach , Mediapipe Holistic with LSTM Model gives a much better accuracy. This approach produced better results with very less amount of data . Since this model trained on fewer parameters, it trained much faster thus resulting in lesser computation time.

Project

This project is divided into two parts:

Keypoints extraction using MediaPipe Holistic
LSTM Model trained on these keypoints to predict realtime sign language using video sequences.

Dataset

Data is collected using MediaPipe Holistic for 3 actions :

Hello
Thanks
I Love You

30 frames have been collected for each action and 30 sequences for each frame have been collected from real time actions using Computer Vision and MediaPipe Holistic. For each sequence , 1662 keypoints have been extracted.

Face Landmarks - 468*3
Pose Landmarks - 33*4
Left Hand Landmarks - 21*3
Right Hand Landmarks - 21*3

The dataset can be accessed from the Feature_Extraction Folder.

Model

LSTM Model is trained using the extracted keypoints from the Feature_Extraction folder and later used for real time predictions.

The Weights of the model are saved in the lstm_model.h5 file.

How to Use

Clone the repository using :

  $ git clone https://github.com/rishusiva/Pose-Network

Install the requirements using:

  $ cd Pose-Network/
  $ pip install -r requirements.txt

To Predict Sign Languages in Real Time , run :

  $ cd Pose-Network/Code
  $ python3 realtime_testing.py

Results

Our LSTM Model, after training for only 100 epochs, has an accuracy of 70%
It produced an accuracy score of 1.0 on a test set of 5 images.
Our Trained LSTM Model is then used for real time testing.

Prediction Results:

Author

Rishikesh Sivakumar

by Rishikesh Sivakumar

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Related tags

Overview

RealTime Sign Language Detection using Action Recognition

Approach

Project

Dataset

Model

How to Use

Results

Prediction Results:

Author

Owner

Rishikesh S

Bravia core script for python

《Rethinking Sptil Dimensions of Vision Trnsformers》(2021)

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

A python library to build Model Trees with Linear Models at the leaves.

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

CC-GENERATOR - A python script for generating CC

Run containerized, rootless applications with podman

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

OpenMMLab Detection Toolbox and Benchmark

Planner_backend - Academic planner application designed for students and counselors.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

Sharpened cosine similarity torch - A Sharpened Cosine Similarity layer for PyTorch

Colour detection is necessary to recognize objects, it is also used as a tool in various image editing and drawing apps.

Joint Detection and Identification Feature Learning for Person Search

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

This repository contains the source code of Auto-Lambda and baselines from the paper, Auto-Lambda: Disentangling Dynamic Task Relationships.

Multi agent DDPG algorithm written in Python + Pytorch

Open-source implementation of Google Vizier for hyper parameters tuning