Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Last update: Dec 29, 2022

Related tags

Deep Learning PRP

Overview

PRP

Introduction

This is the implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Getting started

Install

Our experiments run on Python 3.6.1 and PyTorch 0.4.1. All dependencies can be installed using pip:
```
python -m pip install -r requirements.txt
```

Data preparation

We construct experiments on UCF101 and HMDB51 (the split1 of UCF101 for pre-training and the rest for fine-tuning). The expected dataset directory hierarchy is as follow:

├── UCF101/HMDB51
│   ├── split
│   │   ├── classInd.txt
│   │   ├── testlist01.txt
│   │   ├── trainlist01.txt
│   │   └── ...
│   └── video
│       ├── ApplyEyeMakeup
│       │   └── *.avi
│       └── ...
└── ...

Train and Test Pre-training on Pretext Task

python train_predict.py --gpu 0 --epoch 300 --model_name c3d/r21d/r3d

Action Recognition

python ft_classfy.py --gpu 0 --model_name c3d/r21d/r3d --pre_path [your pre-trained model] --split 1/2/3
python test_classify.py

Video Retrieval

Please refer to the code video_retrieval_samples.py of VCOP.

Model zoo

Models

Pre-trained PRP model on the split1 of UCF101: C3D(OneDrive); R3D(OneDrive); R(2+1)D(OneDrive)
Action Recognition Results

Architecture UCF101(%) HMDB51(%)

C3D 69.1 34.5

R3D 66.5 29.7

R(2+1)D 72.1 35.0

Architecture	UCF101(%)	HMDB51(%)
C3D	69.1	34.5
R3D	66.5	29.7
R(2+1)D	72.1	35.0

License

This project is released under the Apache 2.0 license.

Citation

Please cite the following paper if you feel RSPNet useful to your research

@InProceedings{Yao_2020_CVPR,  
author = {Yao, Yuan and Liu, Chang and Luo, Dezhao and Zhou, Yu and Ye, Qixiang},  
title = {Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning},  
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},  
month = {June},  
year = {2020}  
}

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Related tags

Overview

PRP

Introduction

Getting started

Model zoo

License

Citation

Owner

yuanyao366

S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Training Cifar-10 Classifier Using VGG16

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

A Factor Model for Persistence in Investment Manager Performance

SciPy fixes and extensions

Used to record WKU's utility bills on a regular basis.

This is the repository of shape matching algorithm Iterative Rotations and Assignments (IRA)

This repository gives an example on how to preprocess the data of the HECKTOR challenge

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Learning Open-World Object Proposals without Learning to Classify

Efficient 3D Backbone Network for Temporal Modeling

PaRT: Parallel Learning for Robust and Transparent AI

you can add any codes in any language by creating its respective folder (if already not available).

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

A distributed, plug-n-play algorithm for multi-robot applications with a priori non-computable objective functions

PyTorch ,ONNX and TensorRT implementation of YOLOv4