Test-Time Personalization with a Transformer for Human Pose Estimation, NeurIPS 2021

Last update: Nov 28, 2022

Related tags

Overview

Transforming Self-Supervision in Test Time for Personalizing Human Pose Estimation

This is an official implementation of the NeurIPS 2021 paper: Transforming Self-Supervision in Test Time for Personalizing Human Pose Estimation. More details can be found at our project website.

Preparation

Install dependencies

pip install -r requirements.txt

Make libs
```
cd ${PROJECT_ROOT}/lib
make
```

Place Penn Action data in data directory. (Instructions on Human3.6M and BBC Pose are coming soon.)

Your directory tree should look like this:

${PROJECT_ROOT}
└── data
    └── Penn_Action
        ├── frames
        ├── labels
        ├── tools
        └── README

Download pretrained model of ResNet-18 and ResNet-50 and place them in models/pytorch/imagenet.

Your directory tree should look like this:

${PROJECT_ROOT}
└── models
    └── pytorch
        └── imagenet
            ├── resnet18-5c106cde.pth
            └── resnet50-19c8e357.pth

Training and Test-time Personalization

Training

python tools/train_joint.py \
   --cfg experiments/penn/joint_res50_128x128_1e-3_comb_attn_tf1_4head.yaml

Run Test-Time Personalization (online)

python tools/test_time_training.py \
   --cfg experiments/penn/ttp_res50_128x128_lr1e-4_online_downsample1_comb_attn_tf1_4head.yaml \
   TEST.MODEL_FILE ${MODEL_FILE}

Run Test-Time Personalization (offline)

python tools/test_time_training.py \
   --cfg experiments/penn/ttp_res50_128x128_lr1e-4_offline_downsample1_comb_attn_tf1_4head.yaml \
   TEST.MODEL_FILE ${MODEL_FILE}

Baseline Model

To train the baseline model for comparison

python tools/train.py --cfg experiments/penn/res50_128x128.yaml

Result

Configs, results and model checkpoints on Human3.6M and BBC Pose are coming soon.

Method	TTP Scenario	Penn Action	Checkpoint
Baseline	-	85.233	Google Drive
Ours	before TTP	86.283	Google Drive
Ours	online	87.660	-
Ours	offline	88.633	-

Acknowlegement

TTP is developed based on HRNet. We also incorperate some code from IMM.

Test-Time Personalization with a Transformer for Human Pose Estimation, NeurIPS 2021

Related tags

Overview

Transforming Self-Supervision in Test Time for Personalizing Human Pose Estimation

Preparation

Training and Test-time Personalization

Training

Run Test-Time Personalization (online)

Run Test-Time Personalization (offline)

Baseline Model

Result

Acknowlegement

Owner

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Identify the emotion of multiple speakers in an Audio Segment

Generative Models as a Data Source for Multiview Representation Learning

Generative Adversarial Text-to-Image Synthesis

Unified file system operation experience for different backend

The original weights of some Caffe models, ported to PyTorch.

Bayesian Meta-Learning Through Variational Gaussian Processes

WSDM2022 Challenge - Large scale temporal graph link prediction

A tool to analyze leveraged liquidity mining and find optimal option combination for hedging.

Attentional Focus Modulates Automatic Finger‑tapping Movements

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.

Pytorch implementation of Straight Sampling Network For Point Cloud Learning (ICIP2021).

Orbivator AI - To Determine which features of data (measurements) are most important for diagnosing breast cancer and find out if breast cancer occurs or not.

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

RL agent to play μRTS with Stable-Baselines3

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators