Toward Spatially Unbiased Generative Models (ICCV 2021)

Last update: Dec 01, 2022

Related tags

Overview

Toward Spatially Unbiased Generative Models

Implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Overview

Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We argue that the generators rely on their implicit positional encoding to render spatial content. From our observations, the generator’s implicit positional encoding is translation-variant, making the generator spatially biased. To address this issue, we propose injecting explicit positional encoding at each scale of the generator. By learning the spatially unbiased generator, we facilitate the robust use of generators in multiple tasks, such as GAN inversion, multi-scale generation, generation of arbitrary sizes and aspect ratios. Furthermore, we show that our method can also be applied to denoising diffusion probabilistic models.

Generation

Due to spatial bias, samples of the original GAN are either destructive or stuck to the center when generating from a shifted location.

Original StyleGAN2	MS-PE + StyleGAN2

GAN Inversion

Input	Original StyleGAN2	MS-PE + StyleGAN2

Requirements

I have tested on:

PyTorch 1.7

Usage

Dataset

Create lmdb datasets:

python prepare_data.py --out LMDB_PATH --n_worker N_WORKER --size SIZE1,SIZE2,SIZE3,... DATASET_PATH

This will convert images to jpeg and pre-resizes it. This implementation does not use progressive growing, but you can create multiple resolution datasets using size arguments with comma separated lists, for the cases that you want to try another resolutions later.

Training

python train.py --name EXPERIMENT_NAME --path LMDB_PATH --position mspe

Set position to "none" for original StyleGAN2.

Generation

python generate.py --name EXPERIMENT_NAME --ckpt 550000.pt --truncation 1.0 --position mspe

GAN inversion

python projector.py --name EXPERIMENT_NAME --w_plus --ckpt 550000.pt --position mspe ref_face/00006.png

Notice

Because the current FFHQ dataset is tightly cropped, we used circular translation for proof-of-concept. Therefore, our samples show reflection artifacts at the boundaries. We are looking forward to training on FFHQ-U from alias-free GAN (https://arxiv.org/abs/2106.12423).

Acknowledgement

This code rely heavily on: https://github.com/rosinality/stylegan2-pytorch

Toward Spatially Unbiased Generative Models (ICCV 2021)

Related tags

Overview

Toward Spatially Unbiased Generative Models

Overview

Generation

GAN Inversion

Requirements

Usage

Dataset

Training

Generation

GAN inversion

Notice

Acknowledgement

Owner

Jooyoung Choi

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

Data cleaning, missing value handle, EDA use in this project

PIKA: a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

A developer interface for creating Chat AIs for the Chai app.

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

Process JSON files for neural recording sessions using Medtronic's BrainSense Percept PC neurostimulator

Source code of SIGIR2021 Paper 'One Chatbot Per Person: Creating Personalized Chatbots based on Implicit Profiles'

Codes for the ICCV'21 paper "FREE: Feature Refinement for Generalized Zero-Shot Learning"

Codes for the paper Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing

This is RFA-Toolbox, a simple and easy-to-use library that allows you to optimize your neural network architectures using receptive field analysis (RFA) and create graph visualizations of your architecture.

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Wordle-solver - Wordle answer generation program in python

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

magiCARP: Contrastive Authoring+Reviewing Pretraining

TrackFormer: Multi-Object Tracking with Transformers

Automatic Attendance marker for LMS Practice School Division, BITS Pilani

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets