A DCGAN to generate anime faces using custom mined dataset

Last update: Jan 03, 2023

Overview

Anime-Face-GAN-Keras

A DCGAN to generate anime faces using custom dataset in Keras.

Dataset

The dataset is created by crawling anime database websites using curl. The script anime_dataset_gen.py crawls and processes the images into 64x64 PNG images with only the faces cropped.

Examples of the dataset:

Network

This implementation of GAN uses deconv layers in Keras (networks are initialized in the GAN_Nets.py file). I have tried various combinations of layers such as :
Conv + Upsampling
Conv + bilinear
Conv + Subpixel Upscaling
But none of these combinations yielded any decent results. The case was either GAN fails to generate images that resembles faces or it generates same or very similar looking faces for all batches (generator collapse). But these were my results, maybe techniques such as mini-batch discrimination, z-layers could be used to get better results.

Training

Only simple GAN training methods are used. Training is done on about 22,000 images. Images are not loaded entirely into memory instead, each time a batch is sampled, only the sampled images are loaded. An overview of what happens each step is:
-Sample images from dataset (real data)
-Generate images using generator (gaussian noise as input) (fake data)
-Add noise to labels of real and fake data
-Train discriminator on real data -Train discriminator on fake data
-Train GAN on fake images and real data labels
Training is done for 10,000 steps. In my setup (GTX 660; i5 4670) it takes 10-11 secs for each step.

Loss plot:

Full Training as a GIF: (images sampled every 100 step)

Faces generated at the end of 10,000 steps:

The faces look pretty good IMO, might look more like an actual face with more training, more data and probably with a better network.

Resources

https://github.com/tdrussell/IllustrationGAN
https://github.com/jayleicn/animeGAN
https://github.com/forcecore/Keras-GAN-Animeface-Character

https://distill.pub/2016/deconv-checkerboard/
https://kivantium.net/keras-bilinear

A DCGAN to generate anime faces using custom mined dataset

Related tags

Overview

Anime-Face-GAN-Keras

Dataset

Examples of the dataset:

Network

Training

Loss plot:

Full Training as a GIF: (images sampled every 100 step)

Faces generated at the end of 10,000 steps:

Resources

Owner

Pavitrakumar P

This script scrapes and stores the availability of timeslots for Car Driving Test at all RTA Serivce NSW centres in the state.

Reproduced Code for Image Forgery Detection papers.

Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)

Fully Connected DenseNet for Image Segmentation

PyTorch Lightning + Hydra. A feature-rich template for rapid, scalable and reproducible ML experimentation with best practices. ⚡🔥⚡

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

PyTorch IPFS Dataset

SOTA easy to use PyTorch-based DL training library

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data

Torch-ngp - A pytorch implementation of the hash encoder proposed in instant-ngp

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021

Efficiently computes derivatives of numpy code.

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Build Low Code Automated Tensorflow, What-IF explainable models in just 3 lines of code.