OadTR

Code for our ICCV2021 paper: "OadTR: Online Action Detection with Transformers" ["Paper"]

Update

July 28, 2021: Our Paper "OadTR: Online Action Detection with Transformers" was accepted by ICCV2021. At the same time, we released THUMOS14-Kinetics feature.

Dependencies

pytorch==1.6.0
json
numpy
tensorboard-logger
torchvision==0.7.0

Prepare

Unzip the anno file "./data/anno_thumos.zip"
Download the feature THUMOS14-Anet feature (Note: HDD and TVSeries are available by contacting the authors of the datasets and signing agreements due to the copyrights. You can use this Repo to extract features.)

Training

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1

Validation

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1 --eval --resume models/en_3_decoder_5_lr_drop_1/checkpoint000{}.pth

Citing OadTR

Please cite our paper in your publications if it helps your research:

@article{wang2021oadtr,
  title={OadTR: Online Action Detection with Transformers},
  author={Wang, Xiang and Zhang, Shiwei and Qing, Zhiwu and Shao, Yuanjie and Zuo, Zhengrong and Gao, Changxin and Sang, Nong},
  journal={arXiv preprint arXiv:2106.11149},
  year={2021}
}

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

Related tags

Overview

OadTR

Update

Dependencies

Prepare

Training

Validation

Citing OadTR

Owner

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Hand Gesture Volume Control | Open CV | Computer Vision

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom

🔥 TensorFlow Code for technical report: "YOLOv3: An Incremental Improvement"

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

TilinGNN: Learning to Tile with Self-Supervised Graph Neural Network (SIGGRAPH 2020)

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

To prepare an image processing model to classify the type of disaster based on the image dataset

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Continual Learning of Electronic Health Records (EHR).

3.8% and 18.3% on CIFAR-10 and CIFAR-100

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Active Offline Policy Selection With Python

商品推荐系统