Sign Language Transformers (CVPR'20)

This repo contains the training and evaluation code for the paper Sign Language Transformers: Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation.

This code is based on Joey NMT but modified to realize joint continuous sign language recognition and translation. For text-to-text translation experiments, you can use the original Joey NMT framework.

Requirements

Download the feature files using the data/download.sh script.
[Optional] Create a conda or python virtual environment.
Install required packages using the requirements.txt file.

pip install -r requirements.txt

Usage

python -m signjoey train configs/sign.yaml

! Note that the default data directory is ./data. If you download them to somewhere else, you need to update the data_path parameters in your config file.

ToDo:

Initial code release.
Release image features for Phoenix2014T.
Share extensive qualitative and quantitative results & config files to generate them.
(Nice to have) - Guide to set up conda environment and docker image.

Reference

Please cite the paper below if you use this code in your research:

@inproceedings{camgoz2020sign,
  author = {Necati Cihan Camgoz and Oscar Koller and Simon Hadfield and Richard Bowden},
  title = {Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

Acknowledgements

_{This work was funded by the SNSF Sinergia project "Scalable Multimodal Sign Language Technology for Sign Language Learning and Assessment" (SMILE) grant agreement number CRSII2 160811 and the European Union’s Horizon2020 research and innovation programme under grant agreement no. 762021 (Content4All). This work reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains. We would also like to thank NVIDIA Corporation for their GPU grant.}

Sign Language Transformers (CVPR'20)

Related tags

Overview

Sign Language Transformers (CVPR'20)

Requirements

Usage

ToDo:

Reference

Acknowledgements

Owner

Necati Cihan Camgoz

A full-fledged version of Pix2Seq

[CVPR'2020] DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

Python 3 module to print out long strings of text with intervals of time inbetween

Implementation of CVAE. Trained CVAE on faces from UTKFace Dataset to produce synthetic faces with a given degree of happiness/smileyness.

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Paddle-Adversarial-Toolbox (PAT) is a Python library for Deep Learning Security based on PaddlePaddle.

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Xview3 solution - XView3 challenge, 2nd place solution

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

Inferred Model-based Fuzzer

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Official implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM