Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

Last update: Sep 04, 2022

Overview

ML-PersRef

This repository has python code (in jupyter notebooks) for both of the following papers:
- ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle at ICMI 2021
- Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle at ICMI 2020

Conda Enviroment

To set it up in conda:

conda create --name MLpersref --file requirements.txt

Citation

If you find this code helpful, please cite our papers:


@inproceedings{10.1145/3462244.3479910,
author = {Gomaa, Amr and Reyes, Guillermo and Feld, Michael},
title = {ML-PersRef: A Machine Learning-Based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle},
year = {2021},
isbn = {9781450384810},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3462244.3479910},
doi = {10.1145/3462244.3479910},
pages = {318–327},
numpages = {10},
keywords = {Pointing, Deep Learning, Machine Learning, Personalized Models, Eye Gaze, Object Referencing, Multimodal Fusion},
location = {Montr\'{e}al, QC, Canada},
series = {ICMI '21}
}

@inproceedings{10.1145/3382507.3418817,
author = {Gomaa, Amr and Reyes, Guillermo and Alles, Alexandra and Rupp, Lydia and Feld, Michael},
title = {Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle},
year = {2020},
isbn = {9781450375818},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3382507.3418817},
doi = {10.1145/3382507.3418817},
booktitle = {Proceedings of the 2020 International Conference on Multimodal Interaction},
pages = {501–509},
numpages = {9},
keywords = {head pose, pointing gestures, object referencing, personalized models, eye gaze, multimodal interaction},
location = {Virtual Event, Netherlands},
series = {ICMI '20}
}

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

Related tags

Overview

ML-PersRef

Conda Enviroment

Citation

Code documentation coming soon

Owner

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

GeneralOCR is open source Optical Character Recognition based on PyTorch.

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

A large-scale database for graph representation learning

This repository contains code, network definitions and pre-trained models for working on remote sensing images using deep learning

Real-time Neural Representation Fusion for Robust Volumetric Mapping

An Industrial Grade Federated Learning Framework

CodeContests is a competitive programming dataset for machine-learning

A Blender python script for getting asset browser custom preview images for objects and collections.

Pretraining Representations For Data-Efficient Reinforcement Learning

A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization", Proc. IEEE ISM 2021

Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

AquaTimer - Programmable Timer for Aquariums based on ATtiny414/814/1614

Neural Radiance Fields Using PyTorch

Fast, general, and tested differentiable structured prediction in PyTorch