Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Last update: Apr 15, 2022

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Abstract Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We argue that the generators rely on their implicit positional encoding to render spatial content. From our observations, the generator’s implicit positional encoding is translation-variant, making the generator spatially biased. To address this issue, we propose injecting explicit positional encoding at each scale of the generator. By learning the spatially unbiased generator, we facilitate the robust use of generators in multiple tasks, such as GAN inversion, multi-scale generation, generation of arbitrary sizes and aspect ratios. Furthermore, we show that our method can also be applied to denoising diffusion probabilistic models.

Requirements

Tensorflow >= 2.x

Usage

├── dataset
   └── YOUR_DATASET_NAME
       ├── 000001.jpg 
       ├── 000002.png
       └── ...

Train

> python main.py --dataset FFHQ --phase train --img_size 256 --batch_size 4 --n_total_image 6400

Generate Video

> python generate_video.py

Results

FID: 3.81 (6.4M images(200k iterations), 8GPU, each 4 batch size)

Video

Uncuratd

Style mixing

It's worse than stylegan2.

Truncation trick

Reference

Author

Junho Kim

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Related tags

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Requirements

Usage

Train

Generate Video

Results

Video

Uncuratd

Style mixing

Truncation trick

Reference

Author

Owner

Junho Kim

code for ICCV 2021 paper 'Generalized Source-free Domain Adaptation'

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Office source code of paper UniFuse: Unidirectional Fusion for 360$^\circ$ Panorama Depth Estimation

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

本项目是一个带有前端界面的垃圾分类项目，加载了训练好的模型参数，模型为efficientnetb4，暂时为40分类问题。

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

BirdCLEF 2021 - Birdcall Identification 4th place solution

TJU Deep Learning & Neural Network

Internship Assessment Task for BaggageAI.

DeLiGAN - This project is an implementation of the Generative Adversarial Network

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Oral)

Implementation of Shape Generation and Completion Through Point-Voxel Diffusion

official code for dynamic convolution decomposition

Azua - build AI algorithms to aid efficient decision-making with minimum data requirements.

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss

End-to-End Speech Processing Toolkit