Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.

Last update: Oct 19, 2022

Related tags

Overview

Face2webtoon

Introduction

Despite its importance, there are few previous works applying I2I translation to webtoon. I collected dataset from naver webtoon 연애혁명 and tried to transfer human faces to webtoon domain.

Webtoon Dataset

I used anime face detector. Since face detector is not that good at detecting the faces from webtoon, I could gather only 1400 webtoon face images.

Baseline 0(U-GAT-IT)

I used U-GAT-IT official pytorch implementation. U-GAT-IT is GAN for unpaired image to image translation. By using CAM attention module and adaptive layer instance normalization, it performed well on image translation where considerable shape deformation is required, on various hyperparameter settings. Since shape is very different between two domain, I used this model.

For face data, i used AFAD-Lite dataset from https://github.com/afad-dataset/tarball-lite.

Some results look pretty nice, but many result have lost attributes while transfering.

Missing of Attributes

Gender

Gender information was lost.

Glasses

A model failed to generate glasses in the webtoon faces.

Result Analysis

To analysis the result, I seperated webtoon dataset to 5 different groups.

group number	group name	number of data
0	woman_no_glasses	1050
1	man_no_glasses	249
2	man_glasses	17->49
3	woman_glasses	15->38

Even after I collected more data for group 2 and 3, there are severe imbalances between groups. As a result, model failed to translate to few shot groups, for example, group 2 and 3.

U-GAT-IT + Few Shot Transfer

Few shot transfer : https://arxiv.org/abs/2007.13332

Paper review : https://yun905.tistory.com/48

In this paper, authors successfully transfered the knowledge from group with enough data to few shot groups which have only 10~15 data. First, they trained basic model, and made branches for few shot groups.

Basic model

For basic model, I trained U-GAT-IT between only group 0.

Baseline 1 (simple fine-tuning)

For baseline 1, I freeze the bottleneck layers of generator and tried to fine-tune the basic model. I used 38 images(both real/fake) of group 1,2,3, and added 8 images of group 0 to prevent forgetting. I trained for 200k iterations.

Model randomly mapped between groups.

Baseline 2 (group classification loss + selective backprop)

I attached additional group classifier to discriminator and added group classification loss according to original paper. Images of group 0,1,2,3 were feeded sequentially, and bottleneck layers of generator were updated for group 0 only.

With limited data, bias of FID score is too big. Instead, I used KID

KID*1000
25.95

U-GAT-IT + group classification loss + adaptive discriminator augmentation

ADA is very useful data augmentation method for training GAN with limited data. Although original paper only handles unconditional GANs, I applied ADA to U-GAT-IT which is conditional GAN. Augmentation was applied to both discriminators, because it is expected that preventing the discriminator of the face domain from overfitting would improve the performance of the face generator and therefore the cycle consistency loss would be more meaningful. Only pixel blitting and geometric transformation have been implemented, as the effects of other augmentation methods are minimal according to paper. The rest will be implemented later.

To achieve better result, I changed face dataset to more diverse one(CelebA).

ADA makes training longer. It took 8 days with single 2070 SUPER, but did not converged completely.

KID*1000
12.14

Start training

python main.py --dataset dataset_name --useADA True --group 0,1,2,3 --use_grouploss True --neptune False

If --neptune is True, the experiment is transmitted to neptune ai, which is experiment management tool. You must set your API token. --group 0,1,3 make group 2 out of training.

Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.

Related tags

Overview

Face2webtoon

Introduction

Webtoon Dataset

Baseline 0(U-GAT-IT)

Missing of Attributes

Gender

Glasses

Result Analysis

U-GAT-IT + Few Shot Transfer

Basic model

Baseline 1 (simple fine-tuning)

Baseline 2 (group classification loss + selective backprop)

U-GAT-IT + group classification loss + adaptive discriminator augmentation

Start training

Owner

이상윤

Paper Code：A Self-adaptive Weighted Differential Evolution Approach for Large-scale Feature Selection

Torch implementation of SegNet and deconvolutional network

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Investigating Attention Mechanism in 3D Point Cloud Object Detection (arXiv 2021)

Voice control for Garry's Mod

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

This is the code used in the paper "Entity Embeddings of Categorical Variables".

Official Pytorch implementation of Meta Internal Learning

Pocsploit is a lightweight, flexible and novel open source poc verification framework

Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Software for Multimodalty 2D+3D Facial Expression Recognition (FER) UI

Caffe models in TensorFlow

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation

This repository contains the source code of Auto-Lambda and baselines from the paper, Auto-Lambda: Disentangling Dynamic Task Relationships.

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

Semantic segmentation models, datasets and losses implemented in PyTorch.