GAN JAX - A toy project to generate images from GANs with JAX

Last update: Nov 29, 2022

Related tags

Overview

GAN JAX - A toy project to generate images from GANs with JAX

This project aims to bring the power of JAX, a Python framework developped by Google and DeepMind to train Generative Adversarial Networks for images generation.

JAX

JAX is a framework developed by Deep-Mind (Google) that allows to build machine learning models in a more powerful (XLA compilation) and flexible way than its counterpart Tensorflow, using a framework almost entirely based on the nd.array of numpy (but stored on the GPU, or TPU if available). It also provides new utilities for gradient computation (per sample, jacobian with backward propagation and forward-propagation, hessian...) as well as a better seed system (for reproducibility) and a tool to batch complicated operations automatically and efficiently.

Github link: https://github.com/google/jax

GAN

Generative adversarial networks (GANs) are algorithmic architectures that use two neural networks, pitting one against the other (thus the adversarial) in order to generate new, synthetic instances of data that can pass for real data. They are used widely in image generation, video generation and voice generation. GANs were introduced in a paper by Ian Goodfellow and other researchers at the University of Montreal, including Yoshua Bengio, in 2014. Referring to GANs, Facebook’s AI research director Yann LeCun called adversarial training the most interesting idea in the last 10 years in ML. (source)

Original paper: https://arxiv.org/abs/1406.2661

Some ideas have improved the training of the GANs by the years. For example:

Deep Convolution GAN (DCGAN) paper: https://arxiv.org/abs/1511.06434

Progressive Growing GAN (ProGAN) paper: https://arxiv.org/abs/1710.10196

The goal of this project is to implement these ideas in JAX framework.

Installation

You can install JAX following the instruction on JAX - Installation

It is strongly recommended to run JAX on Linux with CUDA available (Windows has no stable support yet). In this case you can install JAX using the following command:

pip install --upgrade "jax[cuda]" -f https://storage.googleapis.com/jax-releases/jax_releases.html

Then you can install Tensorflow to benefit from tf.data.Dataset to handle the data and the pre-installed dataset. However, Tensorfow allocate memory of the GPU on use (which is not optimal for running calculation with JAX). Therefore, you should install Tensorflow on the CPU instead of the GPU. Visit this site Tensorflow - Installation with pip to install the CPU-only version of Tensorflow 2 depending on your OS and your Python version.

Exemple with Linux and Python 3.9:

pip install tensorflow -f https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow_cpu-2.6.0-cp39-cp39-manylinux2010_x86_64.whl

Then you can install the other librairies from requirements.txt. It will install Haiku and Optax, two usefull add-on libraries to implement and optimize machine learning models with JAX.

pip install -r requirements.txt

Install CelebA dataset (optional)

To use the CelebA dataset, you need to download the dataset from Kaggle and install the images in the folder img_align_celeba/ in data/CelebA/images. It is recommended to download the dataset from this source because the faces are already cropped.

Note: the other datasets will be automatically installed with keras or tensorflow-datasets.

Quick Start

You can test a pretrained GAN model by using apps/test.py. It will download the model from pretrained models (in pre_trained/) and generate pictures. You can change the GAN to test by changing the path in the script.

You can also train your own GAN from scratch with apps/train.py. To change the parameters of the training, you can change the configs in the script. You can also change the dataset or the type of GAN by changing the imports (there is only one workd to change for each).

Example to train a GAN in celeba (64x64):

from utils.data import load_images_celeba_64 as load_images

To train a DCGAN:

from gan.dcgan import DCGAN as GAN

Then you can implement your own GAN and train/test them in your own dataset (by overriding the appropriate functions, check the examples in the repository).

Some results of pre-trained models

- Deep Convolution GAN

On MNIST:

On Cifar10:

On CelebA (64x64):

- Progressive Growing GAN

On MNIST:
On Cifar10:
On CelebA (64x64):
On CelebA (128x128):

GAN JAX - A toy project to generate images from GANs with JAX

Related tags

Overview

GAN JAX - A toy project to generate images from GANs with JAX

JAX

GAN

Installation

Install CelebA dataset (optional)

Quick Start

Some results of pre-trained models

- Deep Convolution GAN

- Progressive Growing GAN

Owner

Valentin Goldité

LaneAF: Robust Multi-Lane Detection with Affinity Fields

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.

Automatically align face images 🙃→🙂. Can also do windowing and warping.

Contextual Attention Localization for Offline Handwritten Text Recognition

Least Square Calibration for Peer Reviews

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

MT-GAN-PyTorch - PyTorch Implementation of Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Hardware-accelerated DNN model inference ROS2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

This repository contains the code for the paper Neural RGB-D Surface Reconstruction

A tight inclusion function for continuous collision detection

Companion code for the paper Theoretical characterization of uncertainty in high-dimensional linear classification

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

SEOVER: Sentence-level Emotion Orientation Vector based Conversation Emotion Recognition Model

Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper