Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Last update: Oct 10, 2022

Related tags

Overview

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,
Linh Van Ma, Tin Trung Tran, Moongu Jeon, ICAIIC 2022 (The 4th International Conference on Artificial Intelligence in Information and Communication February 21 (Mon.) ~ 24 (Thur.), 2022, Guam, USA & Virtual Conference)

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

If you want to finetune this deep learning model. You first need to collect your dataset. You need to look at the center of each rectangle (36 rectangles).

python3 collect_dataset.py

Once you finish collecting your dataset. You need to change the folder of subject in run_finetune.py. Then, you can start finetuning this deep learning model.

python3 run_finetune.py

Remember to rebuild TensorRT if you first run this source in your device. You need to move your working folder to ext\tensorrt_mtcnn.

chmod +x ./build.sh
./build.sh

You now can run to test this gaze estimation by first connect a realsense camera to Jetson TX2. Run the following script.

python3 run_camera.py

To test with your recorded video, you should specify you video location in run_camera_test.py. Run the following script.

python3 run_camera_test.py

Dependencies

FAZE: Few-Shot Adaptive Gaze Estimation: https://github.com/NVlabs/few_shot_gaze
eos: https://github.com/patrikhuber/eos
HRNets: https://github.com/HRNet/HRNet-Facial-Landmark-Detection
mtcnn-pytorch: https://github.com/TropComplique/mtcnn-pytorch
Realtime-facial-landmark-detection: https://github.com/pathak-ashutosh/Realtime-facial-landmark-detection
MTCNN TensorRT(Demo #2: MTCNN): https://github.com/jkjung-avt/tensorrt_demos#mtcnn

5.1 TensorRT MTCNN Face Detector

5.2 Optimizing TensorRT MTCNN

Acknowledgement

A large part of the code is borrowed from FAZE: Few-Shot Adaptive Gaze Estimation and MTCNN TensorRT(Demo #2: MTCNN). Thanks for their wonderful works.

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Related tags

Overview

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

Dependencies

Acknowledgement

Owner

Linh

最新版本yolov5+deepsort目标检测和追踪，支持5.0版本可训练自己数据集

Training Cifar-10 Classifier Using VGG16

Deep Learning pipeline for motor-imagery classification.

This repository comes with the paper "On the Robustness of Counterfactual Explanations to Adverse Perturbations"

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

This repository contains an implementation of ConvMixer for the ICLR 2022 submission "Patches Are All You Need?".

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

This repository attempts to replicate the SqueezeNet architecture and implement the same on an image classification task.

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

Official code for "Decoupling Zero-Shot Semantic Segmentation"

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Encode and decode text application

People Interaction Graph

I will implement Fastai in each projects present in this repository.

Unofficial PyTorch implementation of SimCLR by Google Brain

Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Imaging, analysis, and simulation software for radio interferometry

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

Weakly Supervised Segmentation with Tensorflow. Implements instance segmentation as described in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).