Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Last update: Dec 18, 2022

Overview

QuickDraw - AirGesture

Introduction

Here is my python source code for QuickDraw - an online game developed by google, combined with AirGesture - a simple gesture recognition application. By using my code, you could:

Run an app which you could draw in front of a camera with your hand (If you use laptop, your webcam will be used by default)
Run an app which you could draw on a canvas

Camera app

In order to use this application, you only need to use your hand to draw in front of a camera/webcam. The middle point of your hand will be detected and highlighted by a red dot. When you are ready for drawing, you need to press space button to start drawing. When you want to stop drawing, press space button again. Below is the demo by running the sript camera_app.py:

Camera app demo

Drawing app

The script and demo will be released soon

Categories:

The table below shows 18 categories my model used:


apple	book	bowtie	candle
cloud	cup	door	envelope
eyeglasses	hammer	hat	ice cream
leaf	scissors	star	t-shirt
pants	tree

Trained models

You could find my trained model at data/trained_models/

Docker

For being convenient, I provide Dockerfile which could be used for running training phase as well as launching application

Assume that docker image's name is qd_ag. You already clone this repository and cd into it.

Build:

sudo docker build --network=host -t qd_ag .

Run:

If you want to launch the application, first you need to run xhost + to turn off access control (if you only want to run the training, you could skip this step). Then you run:

sudo docker run --gpus all -it --rm --volume="path/to/your/data:/workspace/code/data -e DISPLAY=$DISPLAY --env="QT_X11_NO_MITSHM=1" -v /tmp/.X11-unix:/tmp/.X11-unix --device=/dev/video0:/dev/video0 qd_ag

Inside docker container, you could run train.py or camera_app.py scripts for training or launching app respectively. By default, the camera_app.py script will automatically generate a video capturing what you have done during the session, at data/output.mp4

Experiments:

For each class, I split the data to training and test sets with ratio of 8:2. The training/test loss/accuracy curves for the experiment are shown below:

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Related tags

Overview

QuickDraw - AirGesture

Introduction

Camera app

Drawing app

Categories:

Trained models

Docker

Experiments:

Owner

Viet Nguyen

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning

Label-Free Model Evaluation with Semi-Structured Dataset Representations

Toolbox to analyze temporal context invariance of deep neural networks

On Effective Scheduling of Model-based Reinforcement Learning

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

Implementation of hyperparameter optimization/tuning methods for machine learning & deep learning models

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

PyGCL: A PyTorch Library for Graph Contrastive Learning

Implementation of "Bidirectional Projection Network for Cross Dimension Scene Understanding" CVPR 2021 (Oral)

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

Turning SymPy expressions into JAX functions

Exploration & Research into cross-domain MEV. Initial focus on ETH/POLYGON.

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

Implementation of a Transformer, but completely in Triton

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Reading list for research topics in Masked Image Modeling