A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Last update: Jul 26, 2022

Overview

PokeGAN

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Dataset

The model has been trained on dataset that includes 819 pokémon.
You can download dataset from this kaggle link.

Dependencies

I have used the following versions for code work:

python==3.8.8
tensorflow==2.4.1
tensorflow-gpu==2.4.1
numpy==1.19.1
h5py==2.10.0

Note

There are several difficulties in pokemon generation using GAN :

The difficulty of GAN training is well known; changing a hyperparameter can greatly change the results.
The dataset size is too small! 819 different pokemon images are not enough. For this reason, I applied data augmentation on the data; these are the transformations applied :

img_transf = tf.keras.Sequential([
            	tf.keras.layers.experimental.preprocessing.RandomContrast(factor=(0.05, 0.15)),
                image_aug.RandomBrightness(brightness_delta=(-0.15, 0.15)),
                image_aug.PowerLawTransform(gamma=(0.8,1.2)),
                image_aug.RandomSaturation(sat=(0, 2)),
                image_aug.RandomHue(hue=(0, 0.15)),
                tf.keras.layers.experimental.preprocessing.RandomFlip("horizontal"),
	    	tf.keras.layers.experimental.preprocessing.RandomTranslation(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomZoom(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomRotation(factor=(-0.10, 0.10))])

StyleGAN training is very expensive! I trained the model starting from a 4x4 resolution up to the final resolution of 256x256. The model was trained for 8 days using a Tesla V100 32GB SXM2.
To get better results you need to use higher resolutions and train for longer time.

Results

These are some examples of new pokémon generated by the model :

New Generated Pokémon

More results

You can see hundreds of new pokemon here.
I repeat again it : to get better results (better details in pokemon) is necessary to train for more time.

References

This code implementation is inspired by the unofficial keras implementation of styleGAN.

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Related tags

Overview

PokeGAN

Dataset

Dependencies

Note

Results

More results

References

Owner

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Fake videos detection by tracing the source using video hashing retrieval.

Semantic-aware Grad-GAN for Virtual-to-Real Urban Scene Adaption

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

Implementation of the federated dual coordinate descent (FedDCD) method.

Bridging Composite and Real: Towards End-to-end Deep Image Matting

Python utility to generate filesystem content for Obsidian.

labelpix is a graphical image labeling interface for drawing bounding boxes

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

This repo holds the code of TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

Deep Learning and Logical Reasoning from Data and Knowledge

The BCNet related data and inference model.

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Source code of CIKM2021 Long Paper "PSSL: Self-supervised Learning for Personalized Search with Contrastive Sampling".

Deep Learning tutorials in jupyter notebooks.

Using pytorch to implement unet network for liver image segmentation.