Generate Cartoon Images using Generative Adversarial Network

Last update: Dec 29, 2022

Overview

AvatarGAN ✨

Generate Cartoon Images using DC-GAN

Deep Convolutional GAN is a generative adversarial network architecture. It uses a couple of guidelines, in particular:

Replacing any pooling layers with strided convolutions (discriminator) and fractional-strided convolutions (generator).
Using batchnorm in both the generator and the discriminator.
Removing fully connected hidden layers for deeper architectures.
Using ReLU activation in generator for all layers except for the output, which uses tanh.
Using LeakyReLU activation in the discriminator for all layer.

Checkout the detailed explanation of AvatarGAN in the article AvatarGAN

GAN Model

Define Generator and Discriminator network architecture
Train the Generator model to generate the fake data that can fool Discriminator
Train the Discriminator model to distinguish real vs fake data
Continue the training for several epochs and save the Generator model

Dataset Setup

Cartoon Set which is a collection of random 2D cartoon avatar images. Download the dataset using the shell script.

sh download-dataset.sh

This will download the dataset in data/ directory. If you want to train the model in Google Colab, upload the dataset folder to Google Drive. The destination path should be projects/cartoons/.

Model Training

Check out the model being trained to generate cartoon images.

Generate Cartoon Images using Generative Adversarial Network

Related tags

Overview

AvatarGAN ✨

GAN Model

Dataset Setup

Model Training

Model Prediction

Owner

Aakash Jhawar

Code for: https://berkeleyautomation.github.io/bags/

Collection of generative models in Tensorflow

Release of the ConditionalQA dataset

Using the provided dataset which includes various book features, in order to predict the price of books, using various proposed methods and models.

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

dualPC.R contains the R code for the main functions.

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)

Repo for the paper "DiLBERT: Cheap Embeddings for Disease Related Medical NLP"

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning.

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Road Crack Detection Using Deep Learning Methods

Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)

Wordplay, an artificial Intelligence based crossword puzzle solver.

Scene-Text-Detection-and-Recognition (Pytorch)

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

Plugin for Gaffer providing direct acess to asset from PolyHaven.com. Only HDRIs at the moment, Cycles and Arnold supported

Deep metric learning methods implemented in Chainer

Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.