Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Last update: Dec 30, 2022

Related tags

Overview

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation

Home | PyTorch BigGAN Discovery | TensorFlow ProGAN Regularization | PyTorch Simple GAN Experiments | Paper

This repo contains code for our OroJaR Regularization that encourages disentanglement in neural networks. It efficiently optimizes the Jacobian vectors of your neural network with repect to each input dimension to be orthogonal, leading to disentanglement results.

This repo contains the following:

Portable OroJaR implementations in both PyTorch and TensorFlow
Edges+Shoes and CLEVR ProGAN Experiments in TensorFlow
BigGAN Direction Discovery Experiments in PyTorch
Other Experiments in PyTorch

Adding the OroJaR to Your Code

We provide portable implementations of the OroJaR that you can easily add to your projects.

PyTorch: orojar_pytorch.py
TensorFlow: orojar_tf.py (needs pip install tensorflow-probability)

Adding the OroJaR to your own code is very simple:

from orojar_pytorch import orojar

net = MyNeuralNet()
input = sample_input()
loss = orojar(G=net, z=input)
loss.backward()

Getting Started

This section and below are only needed if you want to visualize/evaluate/train with our code and models. For using the OroJaR in your own code, you can copy one of the files mentioned in the above section.

Both the TensorFlow and PyTorch codebases are tested with Linux on NVIDIA GPUs. You need at least Python 3.6. To get started, download this repo:

git clone https://github.com/csyxwei/OroJaR.git
cd OroJaR

Then, set-up your environment. You can use the environment.yml file to set-up a Conda environment:

conda env create -f environment.yml
conda activate orojar

If you opt to use your environment, we recommend using TensorFlow 1.14.0 and PyTorch >= 1.6.0. Now you're all set-up.

Citation

If our code aided your research, please cite our paper:

@inproceedings{wei2021orojar,
  title={Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation},
  author={Wei, Yuxiang and Shi, Yupeng and Liu, Xiao and Ji, Zhilong and Gao, Yuan and Wu, Zhongqin and Zuo, Wangmeng},
  booktitle={Proceedings of International Conference on Computer Vision (ICCV)},
  year={2021}
}

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Related tags

Overview

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation

Adding the OroJaR to Your Code

Getting Started

TensorFlow ProgressiveGAN Regularization Experiments

PyTorch BigGAN Direction Discovery Experiments

Other Experiments with Simple GAN

Citation

Owner

Yuxiang Wei

TJU Deep Learning & Neural Network

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals"

This repository contains Prior-RObust Bayesian Optimization (PROBO) as introduced in our paper "Accounting for Gaussian Process Imprecision in Bayesian Optimization"

A python code to convert Keras pre-trained weights to Pytorch version

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Rethinking the U-Net architecture for multimodal biomedical image segmentation

공공장소에서 눈만 돌리면 CCTV가 보인다는 말이 과언이 아닐 정도로 CCTV가 우리 생활에 깊숙이 자리 잡았습니다.

Official PyTorch Implementation of Learning Architectures for Binary Networks

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

Lua-parser-lark - An out-of-box Lua parser written in Lark

Biomarker identification for COVID-19 Severity in BALF cells Single-cell RNA-seq data

tensorflow implementation of 'YOLO : Real-Time Object Detection'

a simple, efficient, and intuitive text editor

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier