This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Last update: Jan 07, 2023

Related tags

Deep Learning StridedTransformer-Pose3D

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

This repo is the official implementation of Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation in Pytorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_gt.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M with 351-frames:

python main.py --frames 351 --refine --reload 1  --refine_reload 1 --previous_dir 'checkpoint/351'

Train the model

To train on Human3.6M with 351-frame:

python main.py --frames 351 --train 1 \

After training for several epoches, add refine module

python main.py --frames 351 --train 1 --refine --lr 1e-5 --reload 1 --previous_dir [your model saved path] \

Citation

If you find our work useful in your research, please consider citing:

@article{li2021exploiting,
  title={Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Ding, Runwei and Liu, Mengyuan and Wang, Pichao and Yang, Wenming},
  journal={arXiv preprint arXiv:2103.14304},
  year={2021}
}

Acknowledgement

Our code is built on top of ST-GCN and is extended from the following repositories. We thank the authors for releasing the codes.

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Related tags

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

The Official PyTorch Implementation of DiscoBox.

Lane follower: Lane-detector (OpenCV) + Object-detector (YOLO5) + CAN-bus

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Facial Expression Detection In The Realtime

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

Code for the AAAI-2022 paper: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Learning Off-Policy with Online Planning, CoRL 2021

On Out-of-distribution Detection with Energy-based Models

This implements the learning and inference/proposal algorithm described in "Learning to Propose Objects, Krähenbühl and Koltun"

Cross-modal Deep Face Normals with Deactivable Skip Connections

Robust & Reliable Route Recommendation on Road Networks

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

Offical implementation of Shunted Self-Attention via Multi-Scale Token Aggregation