Labels4Free: Unsupervised Segmentation using StyleGAN

Last update: Dec 23, 2022

Related tags

Overview

Labels4Free: Unsupervised Segmentation using StyleGAN

ICCV 2021

Figure: Some segmentation masks predicted by Labels4Free Framework on real and synthetic images

We propose an unsupervised segmentation framework for StyleGAN generated objects. We build on two main observations. First, the features generated by StyleGAN hold valuable information that can be utilized towards training segmentation networks. Second, the foreground and background can often be treated to be largely independent and be swapped across images to produce plausible composited images. For our solution, we propose to augment the Style-GAN2 generator architecture with a segmentation branch and to split the generator into a foreground and background network. This enables us to generate soft segmentation masks for the foreground object in an unsupervised fashion. On multiple object classes, we report comparable results against state-of-the-art supervised segmentation networks, while against the best unsupervised segmentation approach we demonstrate a clear improvement, both in qualitative and quantitative metrics.

Labels4Free: Unsupervised Segmentation Using StyleGAN (ICCV 2021)
Rameen Abdal, Peihao Zhu, Niloy Mitra, Peter Wonka
KAUST, Adobe Research

[Paper] [Project Page] [Video]

Installation

Clone this repo.

git clone https://github.com/RameenAbdal/Labels4Free.git
cd Labels4Free/

This repo is based on the Pytorch implementation of StyleGAN2 (rosinality/stylegan2-pytorch). Refer to this repo for setting up the environment, preparation of LMDB datasets and downloading pretrained weights of the models.

Download the pretrained weights of Alpha Networks here

Training the models

The models were trained on 4 RTX 2080 (24 GB) GPUs. In order to train the models using the settings in the paper use the following commands for each dataset.

Checkpoints and samples are saved in ./checkpoint and ./sample folders.

FFHQ dataset

python -m torch.distributed.launch --nproc_per_node=4 train.py --size 1024 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [FFHQ_CONFIG-F_CHECKPOINT]--loss_multiplier 1.2 --iter 1200 --trunc 1.0 --lr 0.0002 --reproduce_model

LSUN-Horse dataset

python -m torch.distributed.launch --nproc_per_node=4 train.py --size 256 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_HORSE_CONFIG-F_CHECKPOINT] --loss_multiplier 3 --iter 500 --trunc 1.0 --lr 0.0002 --reproduce_model

LSUN-Cat dataset

python -m torch.distributed.launch --nproc_per_node=4 train.py --size 256 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_CAT_CONFIG-F_CHECKPOINT]  --loss_multiplier 3 --iter 900 --trunc 0.5 --lr 0.0002 --reproduce_model

LSUN-Car dataset

python train.py --size 512 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_CAR_CONFIG-F_CHECKPOINT] --loss_multiplier 10 --iter 50 --trunc 0.3 --lr 0.002 --sat_weight 1.0 --model_save_freq 25 --reproduce_model --use_disc

In order to train your own models using different settings e.g on a single GPU, using different samples, iterations etc. use the following commands.

FFHQ dataset

python train.py --size 1024 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [FFHQ_CONFIG-F_CHECKPOINT] --loss_multiplier 1.2 --iter 2000 --trunc 1.0 --lr 0.0002 --bg_coverage_wt 3 --bg_coverage_value 0.4

LSUN-Horse dataset

python train.py --size 256 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_HORSE_CONFIG-F_CHECKPOINT] --loss_multiplier 3 --iter 2000 --trunc 1.0 --lr 0.0002 --bg_coverage_wt 6 --bg_coverage_value 0.6

LSUN-Cat dataset

python train.py --size 256 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_CAT_CONFIG-F_CHECKPOINT] --loss_multiplier 3 --iter 2000 --trunc 0.5 --lr 0.0002 --bg_coverage_wt 4 --bg_coverage_value 0.35

LSUN-Car dataset

python train.py --size 512 [LMDB_DATASET_PATH] --batch 2 --n_sample 8 --ckpt [LSUN_CAR_CONFIG-F_CHECKPOINT] --loss_multiplier 20 --iter 750 --trunc 0.3 --lr 0.0008 --sat_weight 0.1 --bg_coverage_wt 40 --bg_coverage_value 0.75 --model_save_freq 50

Sample from the pretrained model

Samples are saved in ./test_sample folder.

python test_sample.py --size [SIZE] --batch 2 --n_sample 100 --ckpt_bg_extractor [ALPHANETWORK_MODEL] --ckpt_generator [GENERATOR_MODEL] --th 0.9

Results on Custom dataset

Folder: Custom dataset, predicted and ground truth masks.

python test_customdata.py --path_gt [GT_Folder] --path_pred [PRED_FOLDER]

Citation

@InProceedings{Abdal_2021_ICCV,
    author    = {Abdal, Rameen and Zhu, Peihao and Mitra, Niloy J. and Wonka, Peter},
    title     = {Labels4Free: Unsupervised Segmentation Using StyleGAN},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {13970-13979}
}

Acknowledgments

This implementation builds upon the Pytorch implementation of StyleGAN2 (rosinality/stylegan2-pytorch). This work was supported by Adobe Research and KAUST Office of Sponsored Research (OSR).

Labels4Free: Unsupervised Segmentation using StyleGAN

Related tags

Overview

Labels4Free: Unsupervised Segmentation using StyleGAN

ICCV 2021

Installation

Training the models

Sample from the pretrained model

Results on Custom dataset

Citation

Acknowledgments

Owner

prior-based-losses-for-medical-image-segmentation

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

[CVPR 2021] Forecasting the panoptic segmentation of future video frames

Code for "OctField: Hierarchical Implicit Functions for 3D Modeling (NeurIPS 2021)"

Python scripts for performing stereo depth estimation using the HITNET Tensorflow model.

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

source code the paper Fast and Robust Iterative Closet Point.

68 keypoint annotations for COFW test data

Learnable Boundary Guided Adversarial Training (ICCV2021)

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

Like Dirt-Samples, but cleaned up

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

alfred-py: A deep learning utility library for human

MogFace: Towards a Deeper Appreciation on Face Detection

Run containerized, rootless applications with podman

Evaluation framework for testing segmentation networks in PyTorch

Labels4Free: Unsupervised Segmentation using StyleGAN

Related tags

Overview

Labels4Free: Unsupervised Segmentation using StyleGAN

ICCV 2021

Installation

Training the models

Sample from the pretrained model

Results on Custom dataset

Citation

Acknowledgments

Owner

prior-based-losses-for-medical-image-segmentation

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

[CVPR 2021] Forecasting the panoptic segmentation of future video frames

Code for "OctField: Hierarchical Implicit Functions for 3D Modeling (NeurIPS 2021)"

Python scripts for performing stereo depth estimation using the HITNET Tensorflow model.

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

source code the paper Fast and Robust Iterative Closet Point.

68 keypoint annotations for COFW test data

Learnable Boundary Guided Adversarial Training (ICCV2021)

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

Like Dirt-Samples, but cleaned up

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

alfred-py: A deep learning utility library for **human**

MogFace: Towards a Deeper Appreciation on Face Detection

Run containerized, rootless applications with podman

Evaluation framework for testing segmentation networks in PyTorch

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

alfred-py: A deep learning utility library for human