Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Last update: Dec 07, 2022

Related tags

Deep Learning SELFY

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

This is the official implementation of the paper "Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition" by H.Kwon, M.Kim, S.Kwak, and M.Cho. For more information, checkout the project website and the paper on arXiv.

Environment:

Cuda: 9.0
gcc: 7.3.0
Python 3.6.8
PyTorch 1.0.1
TorchVison: 0.2.2
Spatial Correlation Sampler
Others: environment.yml

Anaconda environment setting

git clone https://github.com/arunos728/SELFY.git
cd selfy
conda env create -f environment.yml
conda activate selfy

Installing Correlation sampler

cd Pytorch-Correlation-extension
python setup.py install

# check whether SpatialCorrelationSampler is installed correctly.
python check.py forward
python check.py backward
python checkCorrelationSampler.py

Please check this repo for the detailed instructions.

Dataset preparation

Please refer to TSM repo for the detailed data preparation instructions.

File lists (.txt files in ./data) specify configurations of each video clips (path, #frames, class). We upload our Something-Something-V1 & V2 video file lists in ./data. The path of the file lists should be added into the scripts for training (or testing).

Training & Testing

For training SELFYNet on Something-Something, use the following command:

    ./scripts/train_SELFY_Something.sh

For testing your trained model on Something-Something, use the following command:

    ./scripts/test_SELFY_Something.sh

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@inproceedings{kwon2021learning,
  title={Learning self-similarity in space and time as generalized motion for video action recognition},
  author={Kwon, Heeseung and Kim, Manjin and Kwak, Suha and Cho, Minsu},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={13065--13075},
  year={2021}
}

Contact

Heeseung Kwon([email protected]), Manjin Kim([email protected])

Questions can also be left as issues in the repository. We will be happy to answer them.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Related tags

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

Environment:

Anaconda environment setting

Installing Correlation sampler

Dataset preparation

Training & Testing

Citation

Contact

Owner

TAPEX: Table Pre-training via Learning a Neural SQL Executor

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

Course on computational design, non-linear optimization, and dynamics of soft systems at UIUC.

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

Accompanying code for the paper "A Kernel Test for Causal Association via Noise Contrastive Backdoor Adjustment".

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

Fully-automated scripts for collecting AI-related papers

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

[NeurIPS '21] Adversarial Attacks on Graph Classification via Bayesian Optimisation (GRABNEL)

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Pixray is an image generation system

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

A C implementation for creating 2D voronoi diagrams

GPU Programming with Julia - course at the Swiss National Supercomputing Centre (CSCS), ETH Zurich

I-SECRET: Importance-guided fundus image enhancement via semi-supervised contrastive constraining

POCO: Point Convolution for Surface Reconstruction

Tree LSTM implementation in PyTorch