Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Last update: Apr 16, 2022

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

This repository provides a PyTorch implementation of the paper Efficient Two-Step Networks for Temporal Action Segmentation.

Requirements

* Python 3.8.5
* pyTorch 1.8.1

You can download packages using requirements.txt.
pip install -r requirements.txt

Datasets

Download the data provided by MS-TCN, which contains the I3D features (w/o fine-tune) and the ground truth labels for 3 datasets. (~30GB)
Extract it so that you have the data folder in the same directory as train.py.

directory structure

├── config
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├── csv
│   ├── 50salads
│   ├── breakfast
│   └── gtea
├─ dataset ─── 50salads/...
│           ├─ breakfast/...
│           └─ gtea ─── features/
│                    ├─ groundTruth/
│                    ├─ splits/
│                    └─ mapping.txt
├── libs
├── result
├── utils 
├── requirements.txt
├── train.py
├── eval.py
└── README.md

Training and Testing of ETSN

Setting

First, convert ground truth files into numpy array.

python utils/generate_gt_array.py ./dataset

Then, please run the below script to generate csv files for data laoder'.

python utils/builda_dataset.py ./dataset

Training

You can train a model by changing the settings of the configuration file.

python train.py ./config/xxx/xxx/config.yaml

Evaluation

You can evaluate the performance of result after running.

python eval.py ./result/xxx/xxx/config.yaml test

We also provide trained ETSN model in Google Drive. Extract it so that you have the result folder in the same directory as train.py.

average cross validation results

python utils/average_cv_results.py [result_dir]

Citation

If you find our code useful, please cite our paper.

@article{LI2021373,
author = {Yunheng Li and Zhuben Dong and Kaiyuan Liu and Lin Feng and Lianyu Hu and Jie Zhu and Li Xu and Yuhan wang and Shenglan Liu},
journal = {Neurocomputing},
title = {Efficient Two-Step Networks for Temporal Action Segmentation},
year = {2021},
volume = {454},
pages = {373-381},
issn = {0925-2312},
doi = {https://doi.org/10.1016/j.neucom.2021.04.121},
url = {https://www.sciencedirect.com/science/article/pii/S0925231221006998},

}

Contact

For any question, please raise an issue or contact.

Acknowledgement

We appreciate MS-TCN for extracted I3D feature, backbone network and evaluation code.

Appreciating Yuchi Ishikawa shares the re-implementation of MS-TCN with pytorch.

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Related tags

Overview

Efficient Two-Step Networks for Temporal Action Segmentation

Requirements

Datasets

directory structure

Training and Testing of ETSN

Setting

Training

Evaluation

average cross validation results

Citation

Contact

Acknowledgement

Owner

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

Cupytorch - A small framework mimics PyTorch using CuPy or NumPy

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Turning SymPy expressions into PyTorch modules.

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Unofficial implementation of PatchCore anomaly detection

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

This repository collects project-relevant Isabelle/HOL formalizations.

Efficient Speech Processing Tookit for Automatic Speaker Recognition

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

Algebraic effect handlers in Python

Implementation of "A MLP-like Architecture for Dense Prediction"

use machine learning to recognize gesture on raspberrypi

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Learning based AI for playing multi-round Koi-Koi hanafuda card games. Have fun.

Deconfounding Temporal Autoencoder: Estimating Treatment Effects over Time Using Noisy Proxies