Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Last update: Dec 30, 2022

Related tags

Overview

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation

Home | PyTorch BigGAN Discovery | TensorFlow ProGAN Regularization | PyTorch Simple GAN Experiments | Paper

This repo contains code for our OroJaR Regularization that encourages disentanglement in neural networks. It efficiently optimizes the Jacobian vectors of your neural network with repect to each input dimension to be orthogonal, leading to disentanglement results.

This repo contains the following:

Portable OroJaR implementations in both PyTorch and TensorFlow
Edges+Shoes and CLEVR ProGAN Experiments in TensorFlow
BigGAN Direction Discovery Experiments in PyTorch
Other Experiments in PyTorch

Adding the OroJaR to Your Code

We provide portable implementations of the OroJaR that you can easily add to your projects.

PyTorch: orojar_pytorch.py
TensorFlow: orojar_tf.py (needs pip install tensorflow-probability)

Adding the OroJaR to your own code is very simple:

from orojar_pytorch import orojar

net = MyNeuralNet()
input = sample_input()
loss = orojar(G=net, z=input)
loss.backward()

Getting Started

This section and below are only needed if you want to visualize/evaluate/train with our code and models. For using the OroJaR in your own code, you can copy one of the files mentioned in the above section.

Both the TensorFlow and PyTorch codebases are tested with Linux on NVIDIA GPUs. You need at least Python 3.6. To get started, download this repo:

git clone https://github.com/csyxwei/OroJaR.git
cd OroJaR

Then, set-up your environment. You can use the environment.yml file to set-up a Conda environment:

conda env create -f environment.yml
conda activate orojar

If you opt to use your environment, we recommend using TensorFlow 1.14.0 and PyTorch >= 1.6.0. Now you're all set-up.

Citation

If our code aided your research, please cite our paper:

@inproceedings{wei2021orojar,
  title={Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation},
  author={Wei, Yuxiang and Shi, Yupeng and Liu, Xiao and Ji, Zhilong and Gao, Yuan and Wu, Zhongqin and Zuo, Wangmeng},
  booktitle={Proceedings of International Conference on Computer Vision (ICCV)},
  year={2021}
}

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Related tags

Overview

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation

Adding the OroJaR to Your Code

Getting Started

TensorFlow ProgressiveGAN Regularization Experiments

PyTorch BigGAN Direction Discovery Experiments

Other Experiments with Simple GAN

Citation

Owner

Yuxiang Wei

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

Data Preparation, Processing, and Visualization for MoVi Data

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

BalaGAN: Image Translation Between Imbalanced Domains via Cross-Modal Transfer

本步态识别系统主要基于GaitSet模型进行实现

Predictive AI layer for existing databases.

This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Solving reinforcement learning tasks which require language and vision

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

Implementations of the algorithms in the paper Approximative Algorithms for Multi-Marginal Optimal Transport and Free-Support Wasserstein Barycenters

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Deep Learning as a Cloud API Service.

Axel - 3D printed robotic hands and they controll with Raspberry Pi and Arduino combo

Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers on hybryd datasets samples for facial emotion recognition.

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

Flexible-Modal Face Anti-Spoofing: A Benchmark