code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Last update: Dec 14, 2022

Overview

Video_Pace

This repository contains the code for the following paper:

Jiangliu Wang, Jianbo Jiao and Yunhui Liu, "Self-Supervised Video Representation Learning by Pace Prediction", In: ECCV (2020).

Main idea:

Framework:

Requirements

pytroch >= 1.3.0
tensorboardX
cv2
scipy

Usage

Data preparation

UCF101 dataset

Download the original UCF101 dataset from the official website. And then extarct RGB images from videos.
Or direclty download the pre-processed RGB data of UCF101 here provided by feichtenhofer.

Pre-train

Train with pace prediction task on S3D-G, the default clip length is 64 and input video size is 224 x 224.

python train.py --rgb_prefix RGB_DIR --gpu 0,1,2,3 --bs 32 --lr 0.001 --height 256 --width 256 --crop_sz 224 --clip_len 64

Train with pace prediction task on c3d/r3d/r21d, the default clip length is 16 and input video size is 112 x 112.

python train.py --rgb_prefix RGB_DIR --gpu 0 --bs 30 --lr 0.001 --model c3d/r3d/r21d --height 128 --width 171 --crop_sz 112 --clip_len 16

Evaluation

To be updated...

Citation

If you find this work useful or use our code, please consider citing:

@InProceedings{Wang20,
  author       = "Jiangliu Wang and Jianbo Jiao and Yunhui Liu",
  title        = "Self-Supervised Video Representation Learning by Pace Prediction",
  booktitle    = "European Conference on Computer Vision",
  year         = "2020",
}

Acknowlegement

Part of our codes are adapted from S3D-G HowTO100M, we thank the authors for their contributions.

code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Related tags

Overview

Video_Pace

Main idea:

Framework:

Requirements

Usage

Data preparation

Pre-train

Evaluation

Citation

Acknowlegement

Owner

Jiangliu Wang

Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models

Pytorch implementation of Compressive Transformers, from Deepmind

Spectralformer: Rethinking hyperspectral image classification with transformers

Image Segmentation Evaluation

MlTr: Multi-label Classification with Transformer

An end-to-end machine learning web app to predict rugby scores (Pandas, SQLite, Keras, Flask, Docker)

Find the Heart simple Python Game

A novel benchmark dataset for Monocular Layout prediction

Like ThreeJS but for Python and based on wgpu

Out-of-boundary View Synthesis towards Full-frame Video Stabilization

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

PyTorch Lightning implementation of Automatic Speech Recognition

Dynamic Realtime Animation Control

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

Python interface for SmartRF Sniffer 2 Firmware

[ICCV '21] In this repository you find the code to our paper Keypoint Communities

Self Governing Neural Networks (SGNN): the Projection Layer