Orthogonal Over-Parameterized Training

Last update: Apr 18, 2022

Overview

Orthogonal Over-Parameterized Training

By Weiyang Liu, Rongmei Lin, Zhen Liu, James Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller

License

OPT is released under the MIT License (refer to the LICENSE file for details).

Introduction
Citation
Short Video Introduction
Requirements
Usage

Introduction

The inductive bias of a neural network is largely determined by the architecture and the training algorithm. To achieve good generalization, how to effectively train a neural network is of great importance. We propose a novel orthogonal over-parameterized training (OPT) framework that can provably minimize the hyperspherical energy which characterizes the diversity of neurons on a hypersphere. See our previous work -- MHE for an in-depth introduction.

By maintaining the minimum hyperspherical energy during training, OPT can greatly improve the empirical generalization. Specifically, OPT fixes the randomly initialized weights of the neurons and learns an orthogonal transformation that applies to these neurons. We consider multiple ways to learn such an orthogonal transformation, including unrolling orthogonalization algorithms, applying orthogonal parameterization, and designing orthogonality-preserving gradient descent. For better scalability, we propose the stochastic OPT which performs orthogonal transformation stochastically for partial dimensions of neurons.

Our OPT is accepted to CVPR 2021 as oral presentation and the full paper is available on arXiv and here.

Citation

If you find our work useful in your research, please consider to cite:

@InProceedings{Liu2021OPT,
    title={Orthogonal Over-Parameterized Training},
    author={Liu, Weiyang and Lin, Rongmei and Liu, Zhen and Rehg, James M. and Paull, Liam 
     and Xiong, Li and Song, Le and Weller, Adrian},
    booktitle={CVPR},
    year={2021}
}

Short Video Introduction

We also provide a short video introduction to help interested readers quickly go over our work and understand the essence of OPT. Please click the following figure to watch the Youtube video.

Requirements

Python 3.7
TensorFlow 1.14.0

Usage

This repository provides both OPT and S-OPT implementations on CIFAR-100 as a demostration.

Part 1: Clone the repositary

git clone https://github.com/wy1iu/OPT.git

Part 2: Download the official CIFAR-100 training and testing data (python version)

wget https://www.cs.toronto.edu/~kriz/cifar-100-python.tar.gz

Part 3: Train and test with the following code in different folder.

# Run Cayley Parameterization OPT
cd opt_cp
python train.py

# Run Gram-Schmidt OPT
cd opt_gs
python train.py

# Run Householder Reflection OPT
cd opt_hr
python train.py

# Run Lowdin’s Symmetric OPT
cd opt_ls
python train.py

# Run Orthogonality-Preserving Gradient Descent OPT
cd opt_ogd
python train.py

# Run Stochastic OPT (Gram-Schmidt)
cd sopt_gs
python train.py

Orthogonal Over-Parameterized Training

Related tags

Overview

Orthogonal Over-Parameterized Training

License

Contents

Introduction

Citation

Short Video Introduction

Requirements

Usage

Part 1: Clone the repositary

Part 2: Download the official CIFAR-100 training and testing data (python version)

Part 3: Train and test with the following code in different folder.

Contact

Owner

Weiyang Liu

Malware Env for OpenAI Gym

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

GAN-generated image detection based on CNNs

基于DouZero定制AI实战欢乐斗地主

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

3D-Transformer: Molecular Representation with Transformer in 3D Space

The world's simplest facial recognition api for Python and the command line

PyTorch version implementation of DORN

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

🤗 Push your spaCy pipelines to the Hugging Face Hub

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

Exploiting Robust Unsupervised Video Person Re-identification

Deep Distributed Control of Port-Hamiltonian Systems

Neurolab is a simple and powerful Neural Network Library for Python

A curated list of awesome Machine Learning frameworks, libraries and software.

Orthogonal Over-Parameterized Training

Related tags

Overview

Orthogonal Over-Parameterized Training

License

Contents

Introduction

Citation

Short Video Introduction

Requirements

Usage

Part 1: Clone the repositary

Part 2: Download the official CIFAR-100 training and testing data (python version)

Part 3: Train and test with the following code in different folder.

Contact

Owner

Weiyang Liu

Malware Env for OpenAI Gym

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

GAN-generated image detection based on CNNs

基于DouZero定制AI实战欢乐斗地主

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

3D-Transformer: Molecular Representation with Transformer in 3D Space

The world's simplest facial recognition api for Python and the command line

PyTorch version implementation of DORN

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

🤗 Push your spaCy pipelines to the Hugging Face Hub

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

Exploiting Robust Unsupervised Video Person Re-identification

Deep Distributed Control of Port-Hamiltonian Systems

Neurolab is a simple and powerful Neural Network Library for Python

A curated list of awesome Machine Learning frameworks, libraries and software.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.