Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

Last update: Nov 02, 2022

Related tags

Overview

Do pedestrians pay attention? Eye contact detection for autonomous driving

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

Image taken from : https://jooinn.com/people-walking-on-pedestrian-lane-during-daytime.html . Results obtained with the model trained on JackRabbot, Nuscenes, JAAD and Kitti. The model file is available at models/predictor and can be reused for testing with the predictor.

Abstract

In urban or crowded environments, humans rely on eye contact for fast and efficient communication with nearby people. Autonomous agents also need to detect eye contact to interact with pedestrians and safely navigate around them. In this paper, we focus on eye contact detection in the wild, i.e., real-world scenarios for autonomous vehicles with no control over the environment or the distance of pedestrians. We introduce a model that leverages semantic keypoints to detect eye contact and show that this high-level representation (i) achieves state-of-the-art results on the publicly-available dataset JAAD, and (ii) conveys better generalization properties than leveraging raw images in an end-to-end network. To study domain adaptation, we create LOOK: a large-scale dataset for eye contact detection in the wild, which focuses on diverse and unconstrained scenarios for real-world generalization. The source code and the LOOK dataset are publicly shared towards an open science mission.

Requirements
Predictor
- Example command
Create the datasets for training and evaluation
Training your models on LOOK / JAAD / PIE
Evaluate your trained models
Annotate new images
Cite our work

Requirements

Use 3.6.9 <= python < 3.9. Run pip3 install -r requirements.txt to get the dependencies

Predictor

Get predictions from our pretrained model using any image with the predictor. The scripts extracts the human keypoints on the fly using OpenPifPaf. The predictor supports eye contact detection using human keypoints only. You need to specify the following arguments in order to run correctly the script:

Parameter	Description
`--glob`	Glob expression to be used. Example: `.png`
`--images`	Path to the input images. If glob is enabled you need the path to the directory where you have the query images
`--looking_threshold`	Threshold to define an eye contact. Default `0.5`
`--transparency`	Transparency of the output poses. Default `0.4`

Example command:

If you want to reproduce the result of the top image, run:

If you want to run the predictor on a GPU:

python predict.py --images images/people-walking-on-pedestrian-lane-during-daytime-3.jpg

If you want to run the predictor on a CPU:

python predict.py --images images/people-walking-on-pedestrian-lane-during-daytime-3.jpg --device cpu --disable-cuda

Create the datasets for training and evaluation

Please follow the instructions on the folder create_data.

Training your models on LOOK / JAAD / PIE

You have one config file to modify. Do not change the variables name. Check the meaning of each variable to change on the training wiki.

After changing your configuration file, run:

python train.py --file [PATH_TO_CONFIG_FILE]

A sample config file can be found at config_example.ini

Evaluate your trained models

Check the meaning of each variable to change on the evaluation wiki.

After changing your configuration file, run:

python evaluate.py --file [PATH_TO_CONFIG_FILE]

A sample config file can be found at config_example.ini

Annotate new images

Check out the folder annotator in order to run our annotator to annotate new instances for the task.

Credits

Credits to OpenPifPaf for the pose detection part, and JRDB, Nuscenes and Kitti datasets for the images.

Cite our work

If you use our work for your research please cite us :)

@misc{belkada2021pedestrians,
      title={Do Pedestrians Pay Attention? Eye Contact Detection in the Wild}, 
      author={Younes Belkada and Lorenzo Bertoni and Romain Caristan and Taylor Mordan and Alexandre Alahi},
      year={2021},
      eprint={2112.04212},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

Related tags

Overview

Do pedestrians pay attention? Eye contact detection for autonomous driving

Abstract

Table of contents

Requirements

Predictor

Example command:

If you want to run the predictor on a GPU:

If you want to run the predictor on a CPU:

Create the datasets for training and evaluation

Training your models on LOOK / JAAD / PIE

Evaluate your trained models

Annotate new images

Credits

Cite our work

Owner

VITA lab at EPFL

This repository comes with the paper "On the Robustness of Counterfactual Explanations to Adverse Perturbations"

Boosted CVaR Classification (NeurIPS 2021)

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

InvTorch: memory-efficient models with invertible functions

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

A Python module for parallel optimization of expensive black-box functions

Codebase for Amodal Segmentation through Out-of-Task andOut-of-Distribution Generalization with a Bayesian Model

Simulation of Self Driving Car

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

HeatNet is a python package that provides tools to build, train and evaluate neural networks designed to predict extreme heat wave events globally on daily to subseasonal timescales.

FAVD: Featherweight Assisted Vulnerability Discovery

Invariant Causal Prediction for Block MDPs

Code accompanying the NeurIPS 2021 paper "Generating High-Quality Explanations for Navigation in Partially-Revealed Environments"

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

Implementation of "Deep Implicit Templates for 3D Shape Representation"

An official implementation of MobileStyleGAN in PyTorch

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.