Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Last update: Dec 30, 2022

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Pytorch implementation of Neuron Merging: Compensating for Pruned Neurons, accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020).

Requirements

To install requirements:

conda env create -f ./environment.yml

Python environment & main libraries:

python 3.8
pytorch 1.5.0
scikit-learn 0.22.1
torchvision 0.6.0

LeNet-300-100

To test LeNet-300-100 model on FashionMNIST, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script:

model type: original | prune | merge
pruning criterion : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

For example, to test the model after pruning 50% of the neurons with $l_1$-norm criterion, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t prune -c l1-norm -r 0.5

To test the model after merging , run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t merge -c l1-norm -r 0.5

VGG-16

To test VGG-16 model on CIFAR-10, run:

bash scripts/VGG16_CIFAR10.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

As a pretrained model on CIFAR-100 is not included, you must train it first. To train VGG-16 on CIFAR-100, run:

bash scripts/VGG16_CIFAR100_train.sh

All the hyperparameters are as described in the supplementary material.

After training, to test VGG-16 model on CIFAR-100, run:

bash scripts/VGG16_CIFAR100.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

ResNet

To test ResNet-56 model on CIFAR-10, run:

bash scripts/ResNet56_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

To test WideResNet-40-4 model on CIFAR-10, run:

bash scripts/WideResNet_40_4_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

Results

Our model achieves the following performance on (without fine-tuning) :

Image classification of LeNet-300-100 on FashionMNIST

Baseline Accuracy : 89.80%

Pruning Ratio	Prune ($l_1$-norm)	Merge
50%	88.40%	88.69%
60%	85.17%	86.92%
70%	71.26%	82.75%
80%	66.76	80.02%

Image classification of VGG-16 on CIFAR-10

Baseline Accuracy : 93.70%

Criterion	Prune	Merge
$l_1$-norm	88.70%	93.16%
$l_2$-norm	89.14%	93.16%
$l_2$-GM	87.85%	93.10%

Citation

@inproceedings{kim2020merging,
  title     = {Neuron Merging: Compensating for Pruned Neurons},
  author    = {Kim, Woojeong and Kim, Suhyun and Park, Mincheol and Jeon, Geonseok},
  booktitle = {Advances in Neural Information Processing Systems 33},
  year      = {2020}
}

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Requirements

LeNet-300-100

VGG-16

ResNet

Results

Image classification of LeNet-300-100 on FashionMNIST

Image classification of VGG-16 on CIFAR-10

Citation

Owner

Woojeong Kim

Gesture Volume Control Using OpenCV and MediaPipe

LibFewShot: A Comprehensive Library for Few-shot Learning.

Generate fine-tuning samples & Fine-tuning the model & Generate samples by transferring Note On

Code of the paper "Multi-Task Meta-Learning Modification with Stochastic Approximation".

Fast (simple) spectral synthesis and emission-line fitting of DESI spectra.

An Implementation of SiameseRPN with Feature Pyramid Networks

This is a deep learning-based method to segment deep brain structures and a brain mask from T1 weighted MRI.

PyTorch code for JEREX: Joint Entity-Level Relation Extractor

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Implements pytorch code for the Accelerated SGD algorithm.

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

The code used for the free [email protected] Webinar series on Reinforcement Learning in Finance

Single-Shot Motion Completion with Transformer

Artificial Intelligence playing minesweeper 🤖

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more