Deep Learning for Human Part Discovery in Images - Chainer implementation

Last update: Sep 25, 2022

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

NOTE: This is not official implementation. Original paper is Deep Learning for Human Part Discovery in Images.

We are now reproducing the experiments in the original paper. Any contribution will be welcomed!

Requirements

Python 2.7.11+
- Chainer 1.10+
- numpy 1.9+
- scipy 0.16+
- six
- matplotlib
- tqdm
- cv2 (opencv)

Preparation

Data

bash prepare.sh

This script downloads VOC 2010 dataset (http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar) and the authors' original dataset (http://www2.informatik.uni-freiburg.de/~oliveira/datasets/Sitting.tar.gz).

Model

You can download pre-trained FCN model from here.

We will use weights of this model and train new model on VOC dataset.

Start training

python train.py -g 0 -b 3 -e 3000 -l on -s on

Possible options

python train.py --help

GPU memory requirement

Citation from the original paper:

Each minibatch consists of just one image. The learning rate and momentum are fixed to 1e 10 and 0.99, respectively. We train the refinement layer by layer, which takes two days per refinement layer. Thus, the overall training starting from the pre-trained VGG network took 10 days on a single GPU.

Current maximum batchsize is 3 for 12 GB memory GPU.

Also it was confirmed that MBP (Late 2016, memory 16 GiB) can run with batchsize 1.

Result

Now in prep.

Visualize Prediction

python visualize.py -f PATH_TO_IMAGE_FILE

LICENSE

MIT LICENSE.

Author

shiba24, August 2016.

Contributors

bobye

Deep Learning for Human Part Discovery in Images - Chainer implementation

Related tags

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

Requirements

Preparation

Data

Model

Start training

Possible options

GPU memory requirement

Result

Visualize Prediction

LICENSE

Author

Contributors

Owner

Shintaro Shiba

Using modified BiSeNet for face parsing in PyTorch

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

Definition of a business problem according to Wilson Lower Bound Score and Time Based Average Rating

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

Unofficial implementation of Google "CutPaste: Self-Supervised Learning for Anomaly Detection and Localization" in PyTorch

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

Implementation of the paper "Generating Symbolic Reasoning Problems with Transformer GANs"

An end-to-end PyTorch framework for image and video classification

《Geo Word Clouds》paper implementation

E2C implementation in PyTorch

Predicting a person's gender based on their weight and height

This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your own data.

Official implementation of "Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks", NeurIPS 2021.

Official implementation of Few-Shot and Continual Learning with Attentive Independent Mechanisms

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Code for paper 'Hand-Object Contact Consistency Reasoning for Human Grasps Generation' at ICCV 2021

A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Campsite Reservation Finder

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)