Temporal Segment Networks (TSN) in PyTorch

Last update: Jan 03, 2023

Overview

TSN-Pytorch

We have released MMAction, a full-fledged action understanding toolbox based on PyTorch. It includes implementation for TSN as well as other STOA frameworks for various tasks. The lessons we learned in this repo are incorporated into MMAction to make it bettter. We highly recommend you switch to it. This repo will remain here for historical references.

Note: always use git clone --recursive https://github.com/yjxiong/tsn-pytorch to clone this project. Otherwise you will not be able to use the inception series CNN archs.

This is a reimplementation of temporal segment networks (TSN) in PyTorch. All settings are kept identical to the original caffe implementation.

For optical flow extraction and video list generation, you still need to use the original TSN codebase.

Training

To train a new model, use the main.py script.

The command to reproduce the original TSN experiments of RGB modality on UCF101 can be

python main.py ucf101 RGB <ucf101_rgb_train_list> <ucf101_rgb_val_list> \
   --arch BNInception --num_segments 3 \
   --gd 20 --lr 0.001 --lr_steps 30 60 --epochs 80 \
   -b 128 -j 8 --dropout 0.8 \
   --snapshot_pref ucf101_bninception_

For flow models:

python main.py ucf101 Flow <ucf101_flow_train_list> <ucf101_flow_val_list> \
   --arch BNInception --num_segments 3 \
   --gd 20 --lr 0.001 --lr_steps 190 300 --epochs 340 \
   -b 128 -j 8 --dropout 0.7 \
   --snapshot_pref ucf101_bninception_ --flow_pref flow_

For RGB-diff models:

python main.py ucf101 RGBDiff <ucf101_rgb_train_list> <ucf101_rgb_val_list> \
   --arch BNInception --num_segments 7 \
   --gd 40 --lr 0.001 --lr_steps 80 160 --epochs 180 \
   -b 128 -j 8 --dropout 0.8 \
   --snapshot_pref ucf101_bninception_

Testing

After training, there will checkpoints saved by pytorch, for example ucf101_bninception_rgb_checkpoint.pth.

Use the following command to test its performance in the standard TSN testing protocol:

python test_models.py ucf101 RGB <ucf101_rgb_val_list> ucf101_bninception_rgb_checkpoint.pth \
   --arch BNInception --save_scores <score_file_name>

Or for flow models:

python test_models.py ucf101 Flow <ucf101_rgb_val_list> ucf101_bninception_flow_checkpoint.pth \
   --arch BNInception --save_scores <score_file_name> --flow_pref flow_

Temporal Segment Networks (TSN) in PyTorch

Related tags

Overview

TSN-Pytorch

Training

Testing

Owner

Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

Learn about Spice.ai with in-depth samples

Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

RL-GAN: Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation

Materials for upcoming beginner-friendly PyTorch course (work in progress).

Machine learning for NeuroImaging in Python

This is a repository for a semantic segmentation inference API using the OpenVINO toolkit

Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.

Fuwa-http - The http client implementation for the fuwa eco-system

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

Pytorch implementation of "ARM: Any-Time Super-Resolution Method"

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

IPATool-py: download ipa easily

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Estimation of human density in a closed space using deep learning.

CSD: Consistency-based Semi-supervised learning for object Detection