DeLiGAN - This project is an implementation of the Generative Adversarial Network

Last update: Sep 13, 2022

Overview

DeLiGAN

This project is an implementation of the Generative Adversarial Network proposed in our CVPR 2017 paper - DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data. Via this project, we make two contributions:

We propose a simple but effective modification of the GAN framework for settings where training data is diverse yet small in size.
We propose a modification of inception-score proposed by Salimans et al. Our modified inception-score provides a single, unified measure of inter-class and intra-class variety in samples generated by a GAN.

Dependencies

The code for DeLiGAN is provided in Tensorflow 0.10 for the MNIST and Toy dataset, and in Theano 0.8.2 + Lasagne 0.2 for the CIFAR-10 and Sketches dataset. This code was tested on a Ubuntu 14.04 workstation hosting a NVIDIA Titan X GPU.

Datasets

This repository includes implementations for 4 different datasets.

Toy (self generated unimodal and bimodal gaussians)
MNIST (http://www.cs.toronto.edu/~gdahl/mnist.npz.gz)
CIFAR-10 (https://www.cs.toronto.edu/~kriz/cifar.html)
Sketches (http://cybertron.cg.tu-berlin.de/eitz/projects/classifysketch/)

The models for evaluating DeLiGAN on these datasets can be found in our repo. The details for how to download and lay out the datasets can be found in src/datasets/README.md

Usage

Training DeLiGAN models

To run any of the models

First download the datasets and store them in the respective sub-folder of the datasets folder (src/datasets/)
To run the model on any of the datasets, go to the respective src folders and run the dg_'dataset'.py file in the respective dataset folders with two arguments namely, --data_dir and --results_dir. For example, starting from the top-level folder,

cd src/sketches 
python dg_sketches.py --data_dir ../datasets/sketches/ --results_dir ../results/sketches

Note that the results_dir needs to have 'train' as a sub-folder.

Modified inception score

For example, to obtain the modified inception scores on CIFAR

Download the inception-v3 model (http://download.tensorflow.org/models/image/imagenet/inception-2015-12-05.tgz.) and store it in src/modified_inception_scores/cifar10/
Generate samples using the model trained in the dg_cifar.py and copy it to src/modified_inception_scores/cifar10/
Run transfer_cifar10_softmax_b1.py to transfer learn the last layer.
Perform the modifications detailed in the comments in transfer_cifar10_softmax_b1.py and re-run it to evaluate the inception scores.
The provided code can be modified slightly to work for sketches as well by following the comments provided in transfer_cifar10_softmax_b1.py

Parts of the code in this implementation have been borrowed from the Improved-GAN implementation by OpenAI (T. Salimans, I. Goodfellow, W. Zaremba, V. Cheung, A. Radford, and X. Chen. Improved techniques for training gans. In Advances in Neural Information Processing Systems, pages 2226–2234, 2016.)

Cite

@inproceedings{DeLiGAN17,
  author = {Gurumurthy, Swaminathan and Sarvadevabhatla, Ravi Kiran and R. Venkatesh Babu},
  title = {DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data},
  booktitle = {Proceedings of the 2017 Conference on Computer Vision and Pattern Recognition},
  location = {Honolulu, Hawaii, USA}
 }

Q&A

Please send message to [email protected] if you have any query regarding the code.

DeLiGAN - This project is an implementation of the Generative Adversarial Network

Related tags

Overview

DeLiGAN

Dependencies

Datasets

Usage

Training DeLiGAN models

Modified inception score

Cite

Q&A

Owner

Video Analytics Lab -- IISc

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

The fastai book, published as Jupyter Notebooks

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

Unofficial keras(tensorflow) implementation of MAE model from Masked Autoencoders Are Scalable Vision Learners

source code of Adversarial Feedback Loop Paper

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

A small tool to joint picture including gif

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Code for the Paper "Diffusion Models for Handwriting Generation"

Video Autoencoder: self-supervised disentanglement of 3D structure and motion

Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.

Dense Prediction Transformers

The fastest way to visualize GradCAM with your Keras models.

Rethinking Transformer-based Set Prediction for Object Detection

Code for the Convolutional Vision Transformer (ConViT)

TensorFlow (Python) implementation of DeepTCN model for multivariate time series forecasting.

UNAVOIDS: Unsupervised and Nonparametric Approach for Visualizing Outliers and Invariant Detection Scoring