PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Last update: Dec 08, 2022

Overview

An Information-Theoretic Justification for Model Pruning

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning".

An Information-Theoretic Justification for Model Pruning
Berivan Isik, Tsachy Weissman, Albert No
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.

1) Train the baseline model:

To train the baseline model to be compressed, set trainer=Classifier. To try this for ResNet-20, run:

python3 main.py --trainer=Classifier --config=cifar_resnet20/config.yaml

To test the baseline model, run:

python3 main.py --trainer=Classifier --config=cifar_resnet20/config.yaml --test

2) One-shot (non-iterative) reconstruction with SuRP:

To compress the baseline model with SuRP non-iteratively, change the experiment id exp_id of the target model and target sparsity ratio sparsity: [sparsity of the input model, target sparsity] in the recon.yaml file accordingly. Then, run:

python3 main.py --trainer=Reconstruction --config=cifar_resnet20/recon.yaml

3) Iterative reconstruction with SuRP:

To compress the baseline model with SuRP iteratively, apply SuRP several times following a sparsity schedule. Each time, modify exp_id and sparsity: [sparsity of the input model, target sparsity], accordingly. To retrain the sparse models before applying SuRP again, set retrain: True. And run:

python3 main.py --trainer=ReconFromFile --config=cifar_resnet20/recon.yaml

References

If you find this work useful in your research, please consider citing our paper:

@article{isik2021rate,
  title={Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning},
  author={Isik, Berivan and No, Albert and Weissman, Tsachy},
  journal={arXiv preprint arXiv:2102.08329},
  year={2021}
}

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Related tags

Overview

An Information-Theoretic Justification for Model Pruning

1) Train the baseline model:

2) One-shot (non-iterative) reconstruction with SuRP:

3) Iterative reconstruction with SuRP:

References

Owner

Berivan Isik

Code for Paper Predicting Osteoarthritis Progression via Unsupervised Adversarial Representation Learning

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

TensorFlow tutorials and best practices.

2.86% and 15.85% on CIFAR-10 and CIFAR-100

Learning Intents behind Interactions with Knowledge Graph for Recommendation, WWW2021

The ARCA23K baseline system

Pytorch implementation of our method for regularizing nerual radiance fields for few-shot neural volume rendering.

Robocop is your personal mini voice assistant made using Python.

Development kit for MIT Scene Parsing Benchmark

Official code repository for the EMNLP 2021 paper

Final report with code for KAIST Course KSE 801.

Cortex-compatible model server for Python and TensorFlow

This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

LaneDet is an open source lane detection toolbox based on PyTorch that aims to pull together a wide variety of state-of-the-art lane detection models

本步态识别系统主要基于GaitSet模型进行实现

PyTorch implementation of PSPNet segmentation network

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

Semantic-aware Grad-GAN for Virtual-to-Real Urban Scene Adaption

Fashion Landmark Estimation with HRNet

DynaTune: Dynamic Tensor Program Optimization in Deep Neural Network Compilation