OadTR

Code for our ICCV2021 paper: "OadTR: Online Action Detection with Transformers" ["Paper"]

Update

July 28, 2021: Our Paper "OadTR: Online Action Detection with Transformers" was accepted by ICCV2021. At the same time, we released THUMOS14-Kinetics feature.

Dependencies

pytorch==1.6.0
json
numpy
tensorboard-logger
torchvision==0.7.0

Prepare

Unzip the anno file "./data/anno_thumos.zip"
Download the feature THUMOS14-Anet feature (Note: HDD and TVSeries are available by contacting the authors of the datasets and signing agreements due to the copyrights. You can use this Repo to extract features.)

Training

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1

Validation

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1 --eval --resume models/en_3_decoder_5_lr_drop_1/checkpoint000{}.pth

Citing OadTR

Please cite our paper in your publications if it helps your research:

@article{wang2021oadtr,
  title={OadTR: Online Action Detection with Transformers},
  author={Wang, Xiang and Zhang, Shiwei and Qing, Zhiwu and Shao, Yuanjie and Zuo, Zhengrong and Gao, Changxin and Sang, Nong},
  journal={arXiv preprint arXiv:2106.11149},
  year={2021}
}

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

Related tags

Overview

OadTR

Update

Dependencies

Prepare

Training

Validation

Citing OadTR

Owner

A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

implicit displacement field

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Forecasting with Gradient Boosted Time Series Decomposition

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

TensorFlow tutorials and best practices.

Code for Robust Contrastive Learning against Noisy Views

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

Repo 4 basic seminar §How to make human machine readable"

A PyTorch Image-Classification With AlexNet And ResNet50.

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.

A library for uncertainty quantification based on PyTorch

Pretrained Cost Model for Distributed Constraint Optimization Problems