Source code release of the paper: Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.

Last update: Nov 21, 2022

Related tags

Deep Learning GNet-pose

Overview

GNet-pose

Project Page: http://guanghan.info/projects/guided-fractal/

UPDATE 9/27/2018:

Prototxts and model that achieved 93.9Pck on LSP dataset. http://guanghan.info/download/Data/GNet_update.zip

When I was replying e-mails, it occurred to me that the models that I had uploaded was around May/June 2017 (performance in old arxiv version), and in August 2017 the performance was improved to 93.9 on LSP with a newer caffe version which fixed the downsampling and/or upsampling deprecation problem (Yeah, it "magically" improved the performance). The best model was 94.0071 on LSP dataset, but it was not uploaded nor published on the benchmark.

Overview

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.

Source code release of the paper for reproduction of experimental results, and to aid researchers in future research.

Prerequisites

Python 2.7 or Python 3.3+
Modified Caffe

Getting Started

1. Download Data and Pre-trained Models

Datasets (MPII [1], LSP [2])
```
bash ./get_dataset.sh
```
Models
```
bash ./get_models.sh
```
Predictions (optional)
```
bash ./get_preds.sh
```

2. Testing

Generate cropped patches from the dataset for testing:
```
cd testing/
matlab gen_cropped_LSP_test_images.m
matlab gen_cropped_MPII_test_images.m
cd -
```
This will generate images with 368-by-368 resolution.
Reproduce the results with the pre-trained model:
```
cd testing/
python .test.py
cd -
```
You can choose different dataset to test on, with different models. You can also choose different settings in test.py, e.g., with or without flipping, scaling, cross-heatmap regression, etc.

3. Training

Generate Annotations
```
cd training/Annotations/
matlab MPI.m LEEDS.m
cd -
```
This will generate annotations in json files.
Generate LMDB
```
python ./training/Data/genLMDB.py
```
This will load images from dataset and annotations from json files, and generate lmdb files for caffe training.

Generate Prototxt files (optional)

python ./training/GNet/scripts/gen_GNet.py
python ./training/GNet/scripts/gen_fractal.py
python ./training/GNet/scripts/gen_hourglass.py

Training:
```
 bash ./training/train.sh
```

4. Performance Evaluation

cd testing/eval_LSP/; matlab test_evaluation_lsp.m; cd../

cd testing/eval_MPII/; matlab test_evaluation_mpii_test.m

5. Results

More Qualitative results can be found in the project page. Quantitative results please refer to the arxiv paper.

License

GNet-pose is released under the Apache License Version 2.0 (refer to the LICENSE file for details).

Citation

If you use the code and models, please cite the following paper: TMM 2017.

@article{ning2017knowledge, 
 author={G. Ning and Z. Zhang and Z. He}, 
     journal={IEEE Transactions on Multimedia}, 
     title={Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation}, 
     year={2017}, 
     doi={10.1109/TMM.2017.2762010}, 
     ISSN={1520-9210}, }

Reference

[1] Andriluka M, Pishchulin L, Gehler P, et al. "2d human pose estimation: New benchmark and state of the art analysis." CVPR (2014).

[2] Sam Johnson and Mark Everingham. "Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation." BMVC (2010).

Source code release of the paper: Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.

Related tags

Overview

GNet-pose

Overview

Prerequisites

Getting Started

1. Download Data and Pre-trained Models

2. Testing

3. Training

4. Performance Evaluation

5. Results

License

Citation

Reference

Owner

Guanghan Ning

Neural Radiance Fields Using PyTorch

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Development kit for MIT Scene Parsing Benchmark

Utility code for use with PyXLL

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Starter kit for getting started in the Music Demixing Challenge.

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Dataset Condensation with Contrastive Signals

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

This repository contains implementations of all Machine Learning Algorithms from scratch in Python. Mathematics required for ML and many projects have also been included.

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.