The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Last update: May 28, 2022

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers". The repository is modified from StudioGAN. If you find our work useful, please consider citing the following paper:

@inproceedings{chen2021ECGAN,
  title   = {A Unified View of cGANs with and without Classifiers},
  author  = {Si-An Chen and Chun-Liang Li and Hsuan-Tien Lin},
  booktitle = {Advances in Neural Information Processing Systems},
  year    = {2021}
}

Please feel free to contact Si-An Chen if you have any questions about the code/paper.

Introduction

We propose a new Conditional Generative Adversarial Network (cGAN) framework called Energy-based Conditional Generative Adversarial Network (ECGAN) which provides a unified view of cGANs and achieves state-of-the-art results. We use the decomposition of the joint probability distribution to connect the goals of cGANs and classification as a unified framework. The framework, along with a classic energy model to parameterize distributions, justifies the use of classifiers for cGANs in a principled manner. It explains several popular cGAN variants, such as ACGAN, ProjGAN, and ContraGAN, as special cases with different levels of approximations. An illustration of the framework is shown below.

Requirements

Anaconda
Python >= 3.6
6.0.0 <= Pillow <= 7.0.0
scipy == 1.1.0 (Recommended for fast loading of Inception Network)
sklearn
seaborn
h5py
tqdm
torch >= 1.6.0 (Recommended for mixed precision training and knn analysis)
torchvision >= 0.7.0
tensorboard
5.4.0 <= gcc <= 7.4.0 (Recommended for proper use of adaptive discriminator augmentation module)

You can install the recommended environment as follows:

conda env create -f environment.yml -n studiogan

With docker, you can use:

docker pull mgkang/studiogan:0.1

Quick Start

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPU 0

CUDA_VISIBLE_DEVICES=0 python3 src/main.py -t -e -c CONFIG_PATH

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPUs (0, 1, 2, 3) and DataParallel

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 src/main.py -t -e -c CONFIG_PATH

Try python3 src/main.py to see available options.

Dataset

CIFAR10: StudioGAN will automatically download the dataset once you execute main.py.
Tiny Imagenet, Imagenet, or a custom dataset:
1. download Tiny Imagenet and Imagenet. Prepare your own dataset.
2. make the folder structure of the dataset as follows:

┌── docs
├── src
└── data
    └── ILSVRC2012 or TINY_ILSVRC2012 or CUSTOM
        ├── train
        │   ├── cls0
        │   │   ├── train0.png
        │   │   ├── train1.png
        │   │   └── ...
        │   ├── cls1
        │   └── ...
        └── valid
            ├── cls0
            │   ├── valid0.png
            │   ├── valid1.png
            │   └── ...
            ├── cls1
            └── ...

Examples and Results

The src/configs directory contains config files used in our experiments.

CIFAR10 (3x32x32)

To train and evaluate ECGAN-UC on CIFAR10:

python3 src/main.py -t -e -c src/configs/CIFAR10/ecgan_v2_none_0_0p01.json

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	9.746	8.034	0.995	0.994	-	-	-
ContraGAN	StudioGAN	9.729	8.065	0.993	0.992	-	-	-
Ours	-	10.078	7.936	0.990	0.988	Cfg	Log	Link

Tiny ImageNet (3x64x64)

To train and evaluate ECGAN-UC on Tiny ImageNet:

python3 src/main.py -t -e -c src/configs/TINY_ILSVRC2012/ecgan_v2_none_0_0p01.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	11.998	31.92	0.956	0.879	-	-	-
ContraGAN	StudioGAN	13.494	27.027	0.975	0.902	-	-	-
Ours	-	18.445	18.319	0.977	0.973	Cfg	Log	Link

ImageNet (3x128x128)

To train and evaluate ECGAN-UCE on ImageNet (~12 days on 8 NVIDIA V100 GPUs):

python3 src/main.py -t -e -l -sync_bn -c src/configs/ILSVRC2012/imagenet_ecgan_v2_contra_1_0p05.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN	StudioGAN	28.633	24.684	0.941	0.921	-	-	-
ContraGAN	StudioGAN	25.249	25.161	0.947	0.855	-	-	-
Ours	-	80.685	8.491	0.984	0.985	Cfg	Log	Link

Generated Images

Here are some selected images generated by ECGAN.

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

PyTorch implementation of Pay Attention to MLPs

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Answering Open-Domain Questions of Varying Reasoning Steps from Text

CrossNorm and SelfNorm for Generalization under Distribution Shifts (ICCV 2021)

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

Vehicle Detection Using Deep Learning and YOLO Algorithm

DTCN IJCAI - Sequential prediction learning framework and algorithm

2021 Artificial Intelligence Diabetes Datathon

SAMO: Streaming Architecture Mapping Optimisation

SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expertise

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

YOLOV4运行在嵌入式设备上

Asymmetric metric learning for knowledge transfer

A small library of 3D related utilities used in my research.

pq is a jq-like Pickle file viewer

Contains source code for the winning solution of the xView3 challenge