Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Last update: Dec 26, 2022

Overview

MUST-GAN

Code | paper

The Pytorch implementation of our CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation".

Tianxiang Ma, Bo Peng, Wei Wang, Jing Dong,

CRIPAC,NLPR,CASIA & University of Chinese Academy of Sciences.

Test results of our model under self-supervised training:

Pose transfer

Clothes style transfer

Requirement

python3
pytorch 1.1.0
numpy
scipy
scikit-image
pillow
pandas
tqdm
dominate
visdom

Getting Started

Installation

Clone this repo:

git clone https://github.com/TianxiangMa/MUST-GAN.git
cd MUST-GAN

Data Preperation

We train and test our model on Deepfashion dataset. Especially, we utilize High-Res Images in the In-shop Clothes Retrieval Benchmark.

Download this dataset and unzip (You will need to ask for password.) it, then put the folder img_highres under the ./datasets directory. Download train/test split list, which are used by a lot of methods, and put them under ./datasets directory.

Run the following code to split train/test dataset.

python tool/generate_fashion_datasets.py

Download source-target paired images list, as same as the list used by many previous work. Becouse our method can self-supervised training, we do not need the fashion-resize-pairs-train.csv, you can download train_images_lst.csv for training.

Download train/test keypoints annotation files and semantic segmentation files.

Put all the above files into the ./datastes folder.

Run the following code to generate pose map and pose connection map.

python tool/generate_pose_map.py
python tool/generate_pose_connection_map.py

Download vgg pretrained model for training, and put it into ./datasets folder.

Test

Download our pretrained model, and put it into ./check_points/MUST-GAN/ folder.

Run the following code, and set the parameters as your need.

bash scripts/test.sh

Train

Run the following code, and set the parameters as your need.

bash scripts/train.sh

Citation

If you use this code for your research, please cite our paper:

@InProceedings{Ma_2021_CVPR,
    author    = {Ma, Tianxiang and Peng, Bo and Wang, Wei and Dong, Jing},
    title     = {MUST-GAN: Multi-Level Statistics Transfer for Self-Driven Person Image Generation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13622-13631}
}

Acknowledgments

Our code is based on PATN and ADGAN, thanks for their great work.

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Related tags

Overview

MUST-GAN

Code | paper

Requirement

Getting Started

Installation

Data Preperation

Test

Train

Citation

Acknowledgments

Owner

TianxiangMa

This repository contains the database and code used in the paper Embedding Arithmetic for Text-driven Image Transformation

Implementation of Feedback Transformer in Pytorch

This is RFA-Toolbox, a simple and easy-to-use library that allows you to optimize your neural network architectures using receptive field analysis (RFA) and create graph visualizations of your architecture.

Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

AFLFast (extends AFL with Power Schedules)

Do Neural Networks for Segmentation Understand Insideness?

PaSST: Efficient Training of Audio Transformers with Patchout

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.

Scheme for training and applying a label propagation framework

Official code for "Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes", CVPR2022

Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy

A pre-trained language model for social media text in Spanish

Scikit-learn compatible estimation of general graphical models

ScaleNet: A Shallow Architecture for Scale Estimation

A distributed deep learning framework that supports flexible parallelization strategies.

A MNIST-like fashion product database. Benchmark

[v1 (ISBI'21) + v2] MedMNIST: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification

Deep Hedging Demo - An Example of Using Machine Learning for Derivative Pricing.

For storing the complete exploration of Visual Question Answering for our B.Tech Project