Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch

Last update: Dec 22, 2022

Overview

disclaimer: this code is modified from pytorch-tutorial

Image classification with synthetic gradient in Pytorch

I implement the Decoupled Neural Interfaces using Synthetic Gradients in pytorch. The paper uses synthetic gradient to decouple the layers among the network, which is pretty interesting since we won't suffer from update lock anymore. I test my model in mnist and almost the same performance, compared to the model updated with backpropagation.

Requirement

pytorch
python 3.5
torchvision
seaborn (optional)
matplotlib (optional)

TODO

use multi-threading on gpu to analyze the speed

What's synthetic gradients?

We ofter optimize NN by backpropogation, which is usually implemented in some well-known framework. However, is there another way for the layers in NN to communicate with other layers? Here comes the synthetic gradients! It gives us a way to allow neural networks to communicate, to learn to send messages between themselves, in a decoupled, scalable manner paving the way for multiple neural networks to communicate with each other or improving the long term temporal dependency of recurrent networks.
The neuron in each layer will automatically produces an error signal(δa_head) from synthetic-layers and do the optimzation. And how did the error signal generated? Actually, the network still does the backpropogation. While the error signal(δa) from the objective function is not used to optimize the neuron in the network, it is used to optimize the error signal(δa_head) produced by the synthetic-layer. The following is the illustration from the paper:

Result

Feed-Forward Network

Achieve accuracy=96% (compared to the original model, which with accuracy=97%)

classify loss	gradient loss(log level)

cDNI classify loss	cDNI gradient loss(log level)

Convolutional Neural Network

Achieve accuracy=96%, (compared to the original model, which with accuracy=98%)

classify loss	gradient loss(log level)

Usage

Right now I just implement the FCN, CNN versions, which are set as the default network structure.

Run network with synthetic gradient:

python main.py --model_type mlp

python main.py --model_type cnn

Run network with conditioned synthetic gradient:

python main.py --model_type mlp --conditioned True

Run vanilla network, from pytorch-tutorial

python mlp.py

python cnn.py

Reference

Deepmind's post on Decoupled Neural Interfaces Using Synthetic Gradients
Decoupled Neural Interfaces using Synthetic Gradients
Understanding Synthetic Gradients and Decoupled Neural Interfaces

Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch

Related tags

Overview

Image classification with synthetic gradient in Pytorch

Requirement

TODO

What's synthetic gradients?

Result

Feed-Forward Network

Convolutional Neural Network

Usage

Run network with synthetic gradient:

Run network with conditioned synthetic gradient:

Run vanilla network, from pytorch-tutorial

Reference

Owner

Andrew

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

Official implementation of "Robust channel-wise illumination estimation"

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)

Transfer Learning for Pose Estimation of Illustrated Characters

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset

SingleVC performs any-to-one VC, which is an important component of MediumVC project.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Unsupervised Pre-training for Person Re-identification (LUPerson)

Winners of the Facebook Image Similarity Challenge

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

Conditional Generative Adversarial Networks (CGAN) for Mobility Data Fusion

GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

A Temporal Extension Library for PyTorch Geometric

CarND-LaneLines-P1 - Lane Finding Project for Self-Driving Car ND

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

Face Recognition & AI Based Smart Attendance Monitoring System.