PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Last update: Jan 04, 2023

Overview

DiscoGAN in PyTorch

PyTorch implementation of Learning to Discover Cross-Domain Relations with Generative Adversarial Networks.

* All samples in README.md are genearted by neural network except the first image for each row.
* Network structure is slightly diffferent (here) from the author's code.

Requirements

Usage

First download datasets (from pix2pix) with:

$ bash ./data/download_dataset.sh dataset_name

facades: 400 images from CMP Facades dataset.
cityscapes: 2975 images from the Cityscapes training set.
maps: 1096 training images scraped from Google Maps
edges2shoes: 50k training images from UT Zappos50K dataset.
edges2handbags: 137K Amazon Handbag images from iGAN project.

or you can use your own dataset by placing images like:

data
├── YOUR_DATASET_NAME
│   ├── A
│   |   ├── xxx.jpg (name doesn't matter)
│   |   ├── yyy.jpg
│   |   └── ...
│   └── B
│       ├── zzz.jpg
│       ├── www.jpg
│       └── ...
└── download_dataset.sh

All images in each dataset should have same size like using imagemagick:

# for Ubuntu
$ sudo apt-get install imagemagick
$ mogrify -resize 256x256! -quality 100 -path YOUR_DATASET_NAME/A/*.jpg
$ mogrify -resize 256x256! -quality 100 -path YOUR_DATASET_NAME/B/*.jpg

# for Mac
$ brew install imagemagick
$ mogrify -resize 256x256! -quality 100 -path YOUR_DATASET_NAME/A/*.jpg
$ mogrify -resize 256x256! -quality 100 -path YOUR_DATASET_NAME/B/*.jpg

# for scale and center crop
$ mogrify -resize 256x256^ -gravity center -crop 256x256+0+0 -quality 100 -path ../A/*.jpg

To train a model:

$ python main.py --dataset=edges2shoes --num_gpu=1
$ python main.py --dataset=YOUR_DATASET_NAME --num_gpu=4

To test a model (use your load_path):

$ python main.py --dataset=edges2handbags --load_path=logs/edges2handbags_2017-03-18_10-55-37 --num_gpu=0 --is_train=False

Results

1. Toy dataset

Result of samples from 2-dimensional Gaussian mixture models. IPython notebook

# iteration: 0:

# iteration: 10000:

2. Shoes2handbags dataset

# iteration: 11200:

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (shoe -> handbag -> shoe)

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (handbag -> shoe -> handbag)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) -> G_AB(G_BA(G_AB(x_A))) -> G_BA(G_AB(G_BA(G_AB(x_A)))) -> ...

3. Edges2shoes dataset

# iteration: 9600:

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (color -> sketch -> color)

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (sketch -> color -> sketch)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) -> G_AB(G_BA(G_AB(x_A))) -> G_BA(G_AB(G_BA(G_AB(x_A)))) -> ...

4. Edges2handbags dataset

# iteration: 9500:

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (color -> sketch -> color)

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (sketch -> color -> sketch)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) -> G_AB(G_BA(G_AB(x_A))) -> G_BA(G_AB(G_BA(G_AB(x_A)))) -> ...

5. Cityscapes dataset

# iteration: 8350:

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (image -> segmentation -> image)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (segmentation -> image -> segmentation)

6. Map dataset

# iteration: 22200:

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (image -> segmentation -> image)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (segmentation -> image -> segmentation)

7. Facades dataset

Generation and reconstruction on dense segmentation dataset looks weird which are not included in the paper.
I guess a naive choice of mean square error loss for reconstruction need some change on this dataset.

# iteration: 19450:

x_B -> G_BA(x_B) -> G_AB(G_BA(x_B)) (image -> segmentation -> image)

x_A -> G_AB(x_A) -> G_BA(G_AB(x_A)) (segmentation -> image -> segmentation)

Related works

Author

Taehoon Kim / @carpedm20

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Related tags

Overview

DiscoGAN in PyTorch

Requirements

Usage

Results

1. Toy dataset

2. Shoes2handbags dataset

3. Edges2shoes dataset

4. Edges2handbags dataset

5. Cityscapes dataset

6. Map dataset

7. Facades dataset

Related works

Author

Owner

Taehoon Kim

TANL: Structured Prediction as Translation between Augmented Natural Languages

Semi-Supervised Learning, Object Detection, ICCV2021

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

Exploration of some patients clinical variables.

Rl-quickstart - Reinforcement Learning Quickstart

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Implementation of Ag-Grid component for Streamlit

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

GNN-based Recommendation Benchmark

An Approach to Explore Logistic Regression Models

TLoL (Python Module) - League of Legends Deep Learning AI (Research and Development)

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition

LSUN Dataset Documentation and Demo Code

Neural Magic Eye: Learning to See and Understand the Scene Behind an Autostereogram, arXiv:2012.15692.

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

A symbolic-model-guided fuzzer for TLS

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Using LSTM to detect spoofing attacks in an Air-Ground network

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Related tags

Overview

DiscoGAN in PyTorch

Requirements

Usage

Results

1. Toy dataset

2. Shoes2handbags dataset

3. Edges2shoes dataset

4. Edges2handbags dataset

5. Cityscapes dataset

6. Map dataset

7. Facades dataset

Related works

Author

Owner

Taehoon Kim

TANL: Structured Prediction as Translation between Augmented Natural Languages

Semi-Supervised Learning, Object Detection, ICCV2021

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

Exploration of some patients clinical variables.

Rl-quickstart - Reinforcement Learning Quickstart

Continual Learning of Long Topic Sequences in Neural Information Retrieval

Implementation of Ag-Grid component for Streamlit

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

GNN-based Recommendation Benchmark

An Approach to Explore Logistic Regression Models

TLoL (Python Module) - League of Legends Deep Learning AI (Research and Development)

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition

LSUN Dataset Documentation and Demo Code

Neural Magic Eye: Learning to See and Understand the Scene Behind an Autostereogram, arXiv:2012.15692.

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

A symbolic-model-guided fuzzer for TLS

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Using LSTM to detect spoofing attacks in an Air-Ground network

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务