The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Last update: May 28, 2022

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers". The repository is modified from StudioGAN. If you find our work useful, please consider citing the following paper:

@inproceedings{chen2021ECGAN,
  title   = {A Unified View of cGANs with and without Classifiers},
  author  = {Si-An Chen and Chun-Liang Li and Hsuan-Tien Lin},
  booktitle = {Advances in Neural Information Processing Systems},
  year    = {2021}
}

Please feel free to contact Si-An Chen if you have any questions about the code/paper.

Introduction

We propose a new Conditional Generative Adversarial Network (cGAN) framework called Energy-based Conditional Generative Adversarial Network (ECGAN) which provides a unified view of cGANs and achieves state-of-the-art results. We use the decomposition of the joint probability distribution to connect the goals of cGANs and classification as a unified framework. The framework, along with a classic energy model to parameterize distributions, justifies the use of classifiers for cGANs in a principled manner. It explains several popular cGAN variants, such as ACGAN, ProjGAN, and ContraGAN, as special cases with different levels of approximations. An illustration of the framework is shown below.

Requirements

Anaconda
Python >= 3.6
6.0.0 <= Pillow <= 7.0.0
scipy == 1.1.0 (Recommended for fast loading of Inception Network)
sklearn
seaborn
h5py
tqdm
torch >= 1.6.0 (Recommended for mixed precision training and knn analysis)
torchvision >= 0.7.0
tensorboard
5.4.0 <= gcc <= 7.4.0 (Recommended for proper use of adaptive discriminator augmentation module)

You can install the recommended environment as follows:

conda env create -f environment.yml -n studiogan

With docker, you can use:

docker pull mgkang/studiogan:0.1

Quick Start

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPU 0

CUDA_VISIBLE_DEVICES=0 python3 src/main.py -t -e -c CONFIG_PATH

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPUs (0, 1, 2, 3) and DataParallel

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 src/main.py -t -e -c CONFIG_PATH

Try python3 src/main.py to see available options.

Dataset

CIFAR10: StudioGAN will automatically download the dataset once you execute main.py.
Tiny Imagenet, Imagenet, or a custom dataset:
1. download Tiny Imagenet and Imagenet. Prepare your own dataset.
2. make the folder structure of the dataset as follows:

┌── docs
├── src
└── data
    └── ILSVRC2012 or TINY_ILSVRC2012 or CUSTOM
        ├── train
        │   ├── cls0
        │   │   ├── train0.png
        │   │   ├── train1.png
        │   │   └── ...
        │   ├── cls1
        │   └── ...
        └── valid
            ├── cls0
            │   ├── valid0.png
            │   ├── valid1.png
            │   └── ...
            ├── cls1
            └── ...

Examples and Results

The src/configs directory contains config files used in our experiments.

CIFAR10 (3x32x32)

To train and evaluate ECGAN-UC on CIFAR10:

python3 src/main.py -t -e -c src/configs/CIFAR10/ecgan_v2_none_0_0p01.json

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	9.746	8.034	0.995	0.994	-	-	-
ContraGAN	StudioGAN	9.729	8.065	0.993	0.992	-	-	-
Ours	-	10.078	7.936	0.990	0.988	Cfg	Log	Link

Tiny ImageNet (3x64x64)

To train and evaluate ECGAN-UC on Tiny ImageNet:

python3 src/main.py -t -e -c src/configs/TINY_ILSVRC2012/ecgan_v2_none_0_0p01.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	11.998	31.92	0.956	0.879	-	-	-
ContraGAN	StudioGAN	13.494	27.027	0.975	0.902	-	-	-
Ours	-	18.445	18.319	0.977	0.973	Cfg	Log	Link

ImageNet (3x128x128)

To train and evaluate ECGAN-UCE on ImageNet (~12 days on 8 NVIDIA V100 GPUs):

python3 src/main.py -t -e -l -sync_bn -c src/configs/ILSVRC2012/imagenet_ecgan_v2_contra_1_0p05.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN	StudioGAN	28.633	24.684	0.941	0.921	-	-	-
ContraGAN	StudioGAN	25.249	25.161	0.947	0.855	-	-	-
Ours	-	80.685	8.491	0.984	0.985	Cfg	Log	Link

Generated Images

Here are some selected images generated by ECGAN.

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

DeepCAD: A Deep Generative Network for Computer-Aided Design Models

Code for Environment Dynamics Decomposition (ED2).

An addon uses SMPL's poses and global translation to drive cartoon character in Blender.

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

Pixel-wise segmentation on VOC2012 dataset using pytorch.

Code for "Universal inference meets random projections: a scalable test for log-concavity"

Code for Neurips2021 Paper "Topology-Imbalance Learning for Semi-Supervised Node Classification".

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Implementation of Neural Style Transfer in Pytorch

Use evolutionary algorithms instead of gridsearch in scikit-learn

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

TensorFlow Tutorials with YouTube Videos

PyTorch implementation of UNet++ (Nested U-Net).

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

A symbolic-model-guided fuzzer for TLS

The official implementation of CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing