Group Fisher Pruning for Practical Network Compression(ICML2021)

Last update: Dec 13, 2022

Overview

Group Fisher Pruning for Practical Network Compression (ICML2021)

By Liyang Liu*, Shilong Zhang*, Zhanghui Kuang, Jing-Hao Xue, Aojun Zhou, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Updates

All one stage models of Detection has been released (21/6/2021)

NOTES

All models about detection has been released. The classification models will be released later, because we want to refactor all our code into a Hook , so that it can become a more general tool for all tasks in OpenMMLab.

We will continue to improve this method and apply it to more other tasks, such as segmentation and pose.

The layer grouping algorithm is implemtated based on the AutoGrad of Pytorch, If you are not familiar with this feature and you can read Chinese, then these materials may be helpful to you.

Introduction

1. Compare with state-of-the-arts.

2. Can be applied to various complicated structures and various tasks.

3. Boosting inference speed on GPU under same flops.

Get Started

1. Creat a basic environment with pytorch 1.3.0 and mmcv-full

Due to the frequent changes of the autograd interface, we only guarantee the code works well in `pytorch==1.3.0`.

Creat the environment

conda create -n open-mmlab python=3.7 -y
conda activate open-mmlab

Install PyTorch 1.3.0 and corresponding torchvision.

conda install pytorch=1.3.0 cudatoolkit=10.0 torchvision=0.2.2 -c pytorch

Build the mmcv-full from source with pytorch 1.3.0 and cuda 10.0

Please use gcc-5.4 and nvcc 10.0

 git clone https://github.com/open-mmlab/mmcv.git
 cd mmcv
 MMCV_WITH_OPS=1 pip install -e .

2. Install the corresponding codebase in OpenMMLab.

e.g. MMdetection

pip install mmdet==2.13.0

3. Pruning the model.

e.g. Detection

cd detection

Modify the load_from as the path to the baseline model in of xxxx_pruning.py

# for slurm train
sh tools/slurm_train.sh PATITION_NAME JOB_NAME configs/retina/retina_pruning.py work_dir
# for slurm_test
sh tools/slurm_test.sh PATITION_NAME JOB_NAME configs/retina/retina_pruning.py PATH_CKPT --eval bbox
# for torch.dist
# sh tools/dist_train.sh configs/retina/retina_pruning.py 8

4. Finetune the model.

e.g. Detection

cd detection

Modify the deploy_from as the path to the pruned model in custom_hooks of xxxx_finetune.py

# for slurm train
sh tools/slurm_train.sh PATITION_NAME JOB_NAME configs/retina/retina_finetune.py work_dir
# for slurm test
sh tools/slurm_test.sh PATITION_NAME JOB_NAME configs/retina/retina_fintune.py PATH_CKPT --eval bbox
# for torch.dist
# sh tools/dist_train.sh configs/retina/retina_finetune.py 8

Models

Detection

Method	Backbone	Baseline(mAP)	Finetuned(mAP)	Download
RetinaNet	R-50-FPN	36.5	36.5	Baseline/Pruned/Finetuned
ATSS*	R-50-FPN	38.1	37.9	Baseline/Pruned/Finetuned
PAA*	R-50-FPN	39.0	39.4	Baseline/Pruned/Finetuned
FSAF	R-50-FPN	37.4	37.4	Baseline/Pruned/Finetuned

* indicate with no Group Normalization in heads.

Classification

Coming soon.

Please cite our paper in your publications if it helps your research.

@InProceedings{liu2021group,
  title = {Group Fisher Pruning for Practical Network Compression},
  author =       {Liu, Liyang and Zhang, Shilong and Kuang, Zhanghui and Zhou, Aojun and Xue, Jing-Hao and Wang, Xinjiang and Chen, Yimin and Yang, Wenming and Liao, Qingmin and Zhang, Wayne},
  booktitle = {Proceedings of the 38th International Conference on Machine Learning},
  year = {2021},
  series = {Proceedings of Machine Learning Research},
  month = {18--24 Jul},
  publisher ={PMLR},
}

Group Fisher Pruning for Practical Network Compression(ICML2021)

Related tags

Overview

Group Fisher Pruning for Practical Network Compression (ICML2021)

Updates

NOTES

Introduction

1. Compare with state-of-the-arts.

2. Can be applied to various complicated structures and various tasks.

3. Boosting inference speed on GPU under same flops.

Get Started

1. Creat a basic environment with pytorch 1.3.0 and mmcv-full

Due to the frequent changes of the autograd interface, we only guarantee the code works well in pytorch==1.3.0.

Please use gcc-5.4 and nvcc 10.0

2. Install the corresponding codebase in OpenMMLab.

3. Pruning the model.

4. Finetune the model.

Models

Detection

Classification

Please cite our paper in your publications if it helps your research.

Owner

Shilong Zhang

Reinforcement Learning for finance

Collection of common code that's shared among different research projects in FAIR computer vision team.

My implementation of transformers related papers for computer vision in pytorch

Balancing Principle for Unsupervised Domain Adaptation

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Robustness via Cross-Domain Ensembles

Code for our TKDE paper "Understanding WeChat User Preferences and “Wow” Diffusion"

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment

Deep learning with TensorFlow and earth observation data.

MRQy is a quality assurance and checking tool for quantitative assessment of magnetic resonance imaging (MRI) data.

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

Monitora la qualità della ricezione dei segnali radio nelle province siciliane.

PyTorch code for the paper "Curriculum Graph Co-Teaching for Multi-target Domain Adaptation" (CVPR2021)

Baseline and template code for node21 detection track

Due to the frequent changes of the autograd interface, we only guarantee the code works well in `pytorch==1.3.0`.