Create images and texts with the First Order Generative Adversarial Networks

Last update: Dec 11, 2021

Overview

First Order Divergence for training GANs

This repository contains code accompanying the paper First Order Generative Advesarial Netoworks

The majority of the code was copied from the repository https://github.com/bioinf-jku/TTUR

First Order Wasserstein Divergence GAN

The key added value of this code is its implementation two GANS that minimize not the KL-divergence or the WGAN-GP divergence, but the First Order Wasserstein Divergence, leading to better stability and perfomance.

Frechet Inception Distance (FID)

The FID is the performance measure used to evaluate the experiments in the paper. There, a detailed description can be found in the experiment section as well as in the the appendix in section A1.

In short: The Frechet distance between two multivariate Gaussians X_1 ~ N(mu_1, C_1) and X_2 ~ N(mu_2, C_2) is

                   d^2 = ||mu_1 - mu_2||^2 + Tr(C_1 + C_2 - 2*sqrt(C_1*C_2)).

The FID is calculated by assuming that X_1 and X_2 are the activations of the pool_3 layer of the inception model (see below) for generated samples and real world samples respectivly.

Compatibility notice

Previous versions of this repository contained two implementations to calculate the FID, a "unbatched" and a "batched" version. The "unbatched" version should not be used anymore. If you've downloaded this code previously, please update it immediately to the new version. The old version included a bug!

Provided Code

Requirements: TF 1.1, Python 3.x, for faster JSD estimation in language model, compile the language model code.

fid.py

This file contains the implementation of all necessary functions to calculate the FID. It can be used either as a python module imported into your own code, or as a standalone script to calculate the FID between precalculated (training set) statistics and a directory full of images, or between two directories of images.

To compare directories with pre-calculated statistics (e.g. the ones from http://bioinf.jku.at/research/ttur/), use:

fid.py /path/to/images /path/to/precalculated_stats.npz

To compare two directories, use

fid.py /path/to/images /path/to/other_images

See fid.py --help for more details.

fid_example.py

Example code to show the usage of fid.py in your own Python scripts.

precalc_stats_example.py

Example code to show how to calculate and save training set statistics.

WGAN_GP

Improved WGAN (WGAN-GP) implementation forked from https://github.com/igul222/improved_wgan_training with added FID evaluation for the image model and switchable TTUR/orig settings. Lanuage model with JSD Tensorboard logging and switchable TTUR/orig settings.

Precalculated Statistics for FID calculation

Precalculated statistics for datasets

cropped CelebA (calculated on all samples)
LSUN bedroom (calculated on all training samples)
CIFAR 10 (calculated on all training samples)
SVHN (calculated on all training samples)
ImageNet Train (calculated on all training samples)
ImageNet Valid (calculated on all validation samples)

are provided at: http://bioinf.jku.at/research/ttur/

Additional Links

For FID evaluation download the Inception modelf from http://download.tensorflow.org/models/image/imagenet/inception-2015-12-05.tgz

The cropped CelebA dataset can be downloaded here http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

To download the LSUN bedroom dataset go to: http://www.yf.io/p/lsun

The 64x64 downsampled ImageNet training and validation datasets can be found here http://image-net.org/small/download.php

Create images and texts with the First Order Generative Adversarial Networks

Related tags

Overview

First Order Divergence for training GANs

First Order Wasserstein Divergence GAN

Frechet Inception Distance (FID)

Compatibility notice

Provided Code

fid.py

fid_example.py

precalc_stats_example.py

WGAN_GP

Precalculated Statistics for FID calculation

Additional Links

Owner

Zalando Research

OpenLT: An open-source project for long-tail classification

Multimodal Descriptions of Social Concepts: Automatic Modeling and Detection of (Highly Abstract) Social Concepts evoked by Art Images

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

Train the HRNet model on ImageNet

Code for: https://berkeleyautomation.github.io/bags/

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Diabetes-Feature-Engineering - A machine learning model that can predict whether people have diabetes when their characteristics are specified

A Runtime method overload decorator which should behave like a compiled language

Official implementation for CVPR 2021 paper: Adaptive Class Suppression Loss for Long-Tail Object Detection

TensorFlow (Python) implementation of DeepTCN model for multivariate time series forecasting.

SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems

This is the source code of the solver used to compete in the International Timetabling Competition 2019.

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

MagFace: A Universal Representation for Face Recognition and Quality Assessment