Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

Last update: Dec 29, 2022

Overview

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation
_{Official PyTorch implementation of the NeurIPS 2021 paper}

Mingcong Liu, Qiang Li, Zekui Qin, Guoxin Zhang, Pengfei Wan, Wen Zheng

Y-tech, Kuaishou Technology

Project page | Paper

Abstract: Generative Adversarial Networks (GANs) have made a dramatic leap in high-fidelity image synthesis and stylized face generation. Recently, a layer-swapping mechanism has been developed to improve the stylization performance. However, this method is incapable of fitting arbitrary styles in a single model and requires hundreds of style-consistent training images for each style. To address the above issues, we propose BlendGAN for arbitrary stylized face generation by leveraging a flexible blending strategy and a generic artistic dataset. Specifically, we first train a self-supervised style encoder on the generic artistic dataset to extract the representations of arbitrary styles. In addition, a weighted blending module (WBM) is proposed to blend face and style representations implicitly and control the arbitrary stylization effect. By doing so, BlendGAN can gracefully fit arbitrary styles in a unified model while avoiding case-by-case preparation of style-consistent training images. To this end, we also present a novel large-scale artistic face dataset AAHQ. Extensive experiments demonstrate that BlendGAN outperforms state-of-the-art methods in terms of visual quality and style diversity for both latent-guided and reference-guided stylized face synthesis.

Updates

✔️ (2021-11-19) Inference code and pretrained models have been released!

Pre-trained Models

You can download the following pretrained models to ./pretrained_models:

Model	Discription
blendgan	BlendGAN model (together with style_encoder)
psp_encoder	PSP Encoder model
style_encoder	Individual Style Encoder model (optional)

Inference

1. Generate image pairs with random face codes

for latent-guided generation, run:

python generate_image_pairs.py --size 1024 --pics N_PICS --ckpt ./pretrained_models/blendgan.pt --outdir results/generated_pairs/latent_guided/

for reference-guided generation, run:

python generate_image_pairs.py --size 1024 --pics N_PICS --ckpt ./pretrained_models/blendgan.pt --style_img ./test_imgs/style_imgs/100036.png --outdir results/generated_pairs/reference_guided/

2. Style tranfer with given face images

python style_transfer_folder.py --size 1024 --ckpt ./pretrained_models/blendgan.pt --psp_encoder_ckpt ./pretrained_models/psp_encoder.pt --style_img_path ./test_imgs/style_imgs/ --input_img_path ./test_imgs/face_imgs/ --outdir results/style_transfer/

3. Generate interpolation videos

python gen_video.py --size 1024 --ckpt ./pretrained_models/blendgan.pt --psp_encoder_ckpt ./pretrained_models/psp_encoder.pt --style_img_path ./test_imgs/style_imgs/ --input_img_path ./test_imgs/face_imgs/ --outdir results/inter_videos/

Bibtex

If you use this code for your research, please cite our paper:

@inproceedings{liu2021blendgan,
    title = {BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation},
    author = {Liu, Mingcong and Li, Qiang and Qin, Zekui and Zhang, Guoxin and Wan, Pengfei and Zheng, Wen},
    booktitle = {Advances in Neural Information Processing Systems},
    year = {2021}
}

Credits

StyleGAN2 model and implementation:
https://github.com/rosinality/stylegan2-pytorch
Copyright (c) 2019 Kim Seonghyeon
License (MIT) https://github.com/rosinality/stylegan2-pytorch/blob/master/LICENSE

IR-SE50 model and implementations:
https://github.com/TreB1eN/InsightFace_Pytorch
Copyright (c) 2018 TreB1eN
License (MIT) https://github.com/TreB1eN/InsightFace_Pytorch/blob/master/LICENSE

pSp model and implementation:
https://github.com/eladrich/pixel2style2pixel
Copyright (c) 2020 Elad Richardson, Yuval Alaluf
License (MIT) https://github.com/eladrich/pixel2style2pixel/blob/master/LICENSE

Please Note:

The CUDA files under the StyleGAN2 ops directory are made available under the Nvidia Source Code License-NC
The face images under the test_imgs directory are selected from the FFHQ dataset, which is made available under Creative Commons BY-NC-SA 4.0 license by NVIDIA Corporation.
The artistic images under the test_imgs directory are collected from Artstation, and the copyright remains with the original owners.

Acknowledgements

We sincerely thank all the reviewers for their comments. We also thank Zhenyu Guo for help in preparing the comparison to StarGANv2. This code borrows heavily from the pytorch re-implementation of StyleGAN2 by rosinality.

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

Related tags

Overview

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation
_{Official PyTorch implementation of the NeurIPS 2021 paper}

Project page | Paper

Updates

Pre-trained Models

Inference

1. Generate image pairs with random face codes

2. Style tranfer with given face images

3. Generate interpolation videos

Bibtex

Credits

Acknowledgements

Owner

onion

Flax is a neural network ecosystem for JAX that is designed for flexibility.

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Trustworthy AI related projects

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)

Adversarial-autoencoders - Tensorflow implementation of Adversarial Autoencoders

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

training script for space time memory network

Listing arxiv - Personalized list of today's articles from ArXiv

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

Code for "Multi-Time Attention Networks for Irregularly Sampled Time Series", ICLR 2021.

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

Unsupervised Feature Ranking via Attribute Networks.

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

Related tags

Overview

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation Official PyTorch implementation of the NeurIPS 2021 paper

Project page | Paper

Updates

Pre-trained Models

Inference

1. Generate image pairs with random face codes

2. Style tranfer with given face images

3. Generate interpolation videos

Bibtex

Credits

Acknowledgements

Owner

onion

Flax is a neural network ecosystem for JAX that is designed for flexibility.

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Trustworthy AI related projects

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)

Adversarial-autoencoders - Tensorflow implementation of Adversarial Autoencoders

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

training script for space time memory network

Listing arxiv - Personalized list of today's articles from ArXiv

PyTorch Code for "Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning"

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

Code for "Multi-Time Attention Networks for Irregularly Sampled Time Series", ICLR 2021.

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

Unsupervised Feature Ranking via Attribute Networks.

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation
_{Official PyTorch implementation of the NeurIPS 2021 paper}