Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Last update: Dec 01, 2022

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Bo Li, Qiulin Wang, Jiquan Pei, Yu Yang, Xiangyang Ji

Abstract: The semantically disentangled latent subspace in GAN provides rich interpretable controls in image generation. This paper includes two contributions on semantic latent subspace analysis in the scenario of face generation using StyleGAN2. First, we propose a novel approach to disentangle latent subspace semantics by exploiting existing face analysis models, e.g., face parsers and face landmark detectors. These models provide the flexibility to construct various criterions with very concrete and interpretable semantic meanings (e.g., change face shape or change skin color) to restrict latent subspace disentanglement. Rich latent space controls unknown previously can be discovered using the constructed criterions. Second, we propose a new perspective to explain the behavior of a CNN classifier by generating counterfactuals in the interpretable latent subspaces we discovered. This explanation helps reveal whether the classifier learns semantics as intended. Experiments on various disentanglement criterions demonstrate the effectiveness of our approach. We believe this approach contributes to both areas of image manipulation and counterfactual explainability of CNNs.

The code is developed on NVlabs/stylegan2-ada-pytorch and put in the ice folder. Please play with the two ipython notebooks.

ice/discover_subspaces

Solve subspaces by using face analysis models as criterions. Currently we only include several representative subspaces. The notebook requires to download some pre-trained models. You might have to spend some efforts to put everything at the right place. See the notebook comments for details. This notebook shows the code sketch to generate Figure 3 (as below) in the paper, i.e., the latent subspace for interpretable face manipulation.

ice/explain_counterfactually

Use the interpretable subspaces discovered by the above notebook to explain the classifier of attractiveness. This notebook shows the code sketch to generate Figure 4 (as below) in the paper, i.e., the interpretable counterfactuals to increase attractiveness score of a given classifier. Since we did not find good public pre-trained model. The attractiveness classifier is trained by ourselves using d-li14/face-attribute-prediction.

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Related tags

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Owner

Bo Li

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

Small utility to demangle Nim symbols in callgrind files

FeTaQA: Free-form Table Question Answering

Code for How To Create A Fully Automated AI Based Trading System With Python

Activity image-based video retrieval

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

A keras-based real-time model for medical image segmentation (CFPNet-M)

Convert openmmlab (not only mmdetection) series model to tensorrt

Voice control for Garry's Mod

CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution

A model to classify a piece of news as REAL or FAKE

OCR Streamlit App is used to extract text from images using python's easyocr, pytorch and streamlit packages

This is the open-source reference implementation of the SIGGRAPH 2021 paper Intersection-free Rigid Body Dynamics.

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

The implemetation of Dynamic Nerual Garments proposed in Siggraph Asia 2021

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

Code for "My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack" paper

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.