This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Last update: Jul 14, 2022

Related tags

Deep Learning L2ight

Overview

L2ight

By Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen and David Z. Pan.

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Introduction

L2ight is a closed-loop ONN on-chip learning framework to enable scalable ONN mapping and efficient in-situ learning. L2ight adopts a three-stage learning flow that first calibrates the complicated photonic circuit states under challenging physical constraints, then performs photonic core mapping via combined analytical solving and zeroth-order optimization. A subspace learning procedure with multi-level sparsity is integrated into L2ight to enable in-situ gradient evaluation and fast adaptation, unleashing the power of optics for real on-chip intelligence. L2ight outperforms prior ONN training protocols with 3-order-of-magnitude higher scalability and over 30X better efficiency, when benchmarked on various models and learning tasks. This synergistic framework is the first scalable on-chip learning solution that pushes this emerging field from intractable to scalable and further to efficient for next-generation self-learnable photonic neural chips.

Dependencies

Python >= 3.6
pyutils >= 0.0.1. See pyutils for installation.
pytorch-onn >= 0.0.1. See pytorch-onn for installation.
Python libraries listed in requirements.txt
NVIDIA GPUs and CUDA >= 10.2

Structures

core/
- models/
  - layers/
    - custom_conv2d and custom_linear layers
    - utils.py: sampler and profiler
  - sparse_bp_*.py: model definition
  - sparse_bp_base.py: base model definition; identity calibration and mapping codes.
- optimizer/: mixedtrain and flops optimizers
- builder.py: build training utilities
script/: contains experiment scripts
train_pretrain.py, train_map.py, train_learn.py, train_zo_learn.py: training logic
compare_gradient.py: compare approximated gradients with true gradients for ablation

Usage

Pretrain model.
> python3 train_pretrain.py config/cifar10/vgg8/pretrain.yml
Identity calibration and parallel mapping. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/pm/pm.yml and run
> python3 train_map.py CONFIG --checkpoint.restore_checkpoint=path/to/your/pretrained/checkpoint
Subspace learning with multi-level sampling. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/ds/learn.yml and run
> python3 train_learn.py CONFIG --checkpoint.restore_chekcpoint=path/to/your/mapped/checkpoint --checkpoint.resume=1
All scripts for experiments are in ./script. For example, to run subspace learning with feedback sampling, column sampling, and data sampling, you can write proper task setting in SCRIPT=script/vgg8/train_ds_script.py and run
> python3 SCRIPT
Comparison experiments with RAD [ICLR 2021] and SWAT-U [NeurIPS 2020]. Run with the SCRIPT=script/vgg8/train_rad_script.py and script/vgg8/train_swat_script.py,
> python3 SCRIPT
Comparison with FLOPS [DAC 2020] and MixedTrn [AAAI 2021]. Run with the METHOD=mixedtrain or flops,
> python3 train_zo_learn.py config/mnist/cnn3/METHOD/learn.yml

Citing L2ight

@inproceedings{gu2021L2ight,
  title={L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization},
  author={Jiaqi Gu and Hanqing Zhu and Chenghao Feng and Zixuan Jiang and Ray T. Chen and David Z. Pan},
  journal={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Related tags

Overview

L2ight

Introduction

Dependencies

Structures

Usage

Citing L2ight

Owner

Jiaqi Gu

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.

Streaming over lightweight data transformations

How to Train a GAN? Tips and tricks to make GANs work

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Official Implementation of VAT

Official implementation of "Learning Forward Dynamics Model and Informed Trajectory Sampler for Safe Quadruped Navigation" (RSS 2022)

Invariant Causal Prediction for Block MDPs

VideoGPT: Video Generation using VQ-VAE and Transformers

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

A PyTorch Implementation of "SINE: Scalable Incomplete Network Embedding" (ICDM 2018).

Urban mobility simulations with Python3, RLlib (Deep Reinforcement Learning) and Mesa (Agent-based modeling)

Python package for Bayesian Machine Learning with scikit-learn API

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

PyTorch implementation of Tacotron speech synthesis model.

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

A collection of resources on GAN Inversion.

RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering