Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Last update: Dec 27, 2022

Related tags

Deep Learning InfoPro-Pytorch

Overview

InfoPro-Pytorch

The Information Propagation algorithm for training deep networks with local supervision.

(ICLR 2021) Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

Update on 2021/01/25: Release Pre-trained models on ImageNet and Cityscapes.

Update on 2021/01/24: Release Code for Image Classification on CIFAR/SVHN/STL10/ImageNet and Semantic Segmentation on Cityscapes.

Introduction

We propose Information Propagation (InfoPro), a locally supervised deep learning algorithm, from the information-theoretic perspective. By splitting the whole deep network into multiple local modules and training them with local InfoPro loss, we reduce the GPU memory footprint by 40-60% without introducing notable extra computational cost or training time, but improve the performance moderately.

Citation

If you find this work valuable or use our code in your own research, please consider citing us with the following bibtex:

@inproceedings{wang2021revisiting,
        title = {Revisiting Locally Supervised Learning: an Alternative to End-to-end Training},
       author = {Yulin Wang and Zanlin Ni and Shiji Song and Le Yang and Gao Huang},
    booktitle = {International Conference on Learning Representations (ICLR)},
         year = {2021},
          url = {https://openreview.net/forum?id=fAbkE6ant2}
}

Get Started

Please go to the folder Experiments on CIFAR-SVHN-STL10, Experiments on ImageNet and Semantic segmentation for specific docs.

Results

CIFAR & STL-10

ImageNet

Semantic Segmentation

GPU Memory Cost

In the paper, we report the minimally required GPU memory to run the InfoPro* algorithm with torch.backends.cudnn.benchmark=True (for practical acceleration). Note that this result is (sometimes largely) different from what is printed by nvidia-smi.

Contact

This repo is a re-implementation of our original code. If you have any question, please feel free to contact the authors. Yulin Wang: [email protected].

Acknowledgments

Our code of Semantic Segmentation is from MMSegmentation. We highly appreciate their awesome work!

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Related tags

Overview

InfoPro-Pytorch

Introduction

Citation

Get Started

Results

GPU Memory Cost

Contact

Acknowledgments

Owner

PConv-Keras - Unofficial implementation of "Image Inpainting for Irregular Holes Using Partial Convolutions". Try at: www.fixmyphoto.ai

Efficient neural networks for analog audio effect modeling

Cross View SLAM

TensorFlow port of PyTorch Image Models (timm) - image models with pretrained weights.

[ICCV 2021] Deep Hough Voting for Robust Global Registration

SMD-Nets: Stereo Mixture Density Networks

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

Final project code: Implementing BicycleGAN, for CIS680 FA21 at University of Pennsylvania

Code Impementation for "Mold into a Graph: Efficient Bayesian Optimization over Mixed Spaces"

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Official implementation of "Learning Proposals for Practical Energy-Based Regression", 2021.

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition

DexterRedTool - Dexter's Red Team Tool that creates cronjob/task scheduler to consistently creates users

Code release for ICCV 2021 paper "Anticipative Video Transformer"

Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization

Neon: an add-on for Lightbulb making it easier to handle component interactions

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)