Group Fisher Pruning for Practical Network Compression(ICML2021)

Last update: Dec 13, 2022

Overview

Group Fisher Pruning for Practical Network Compression (ICML2021)

By Liyang Liu*, Shilong Zhang*, Zhanghui Kuang, Jing-Hao Xue, Aojun Zhou, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Updates

All one stage models of Detection has been released (21/6/2021)

NOTES

All models about detection has been released. The classification models will be released later, because we want to refactor all our code into a Hook , so that it can become a more general tool for all tasks in OpenMMLab.

We will continue to improve this method and apply it to more other tasks, such as segmentation and pose.

The layer grouping algorithm is implemtated based on the AutoGrad of Pytorch, If you are not familiar with this feature and you can read Chinese, then these materials may be helpful to you.

Introduction

1. Compare with state-of-the-arts.

2. Can be applied to various complicated structures and various tasks.

3. Boosting inference speed on GPU under same flops.

Get Started

1. Creat a basic environment with pytorch 1.3.0 and mmcv-full

Due to the frequent changes of the autograd interface, we only guarantee the code works well in `pytorch==1.3.0`.

Creat the environment

conda create -n open-mmlab python=3.7 -y
conda activate open-mmlab

Install PyTorch 1.3.0 and corresponding torchvision.

conda install pytorch=1.3.0 cudatoolkit=10.0 torchvision=0.2.2 -c pytorch

Build the mmcv-full from source with pytorch 1.3.0 and cuda 10.0

Please use gcc-5.4 and nvcc 10.0

 git clone https://github.com/open-mmlab/mmcv.git
 cd mmcv
 MMCV_WITH_OPS=1 pip install -e .

2. Install the corresponding codebase in OpenMMLab.

e.g. MMdetection

pip install mmdet==2.13.0

3. Pruning the model.

e.g. Detection

cd detection

Modify the load_from as the path to the baseline model in of xxxx_pruning.py

# for slurm train
sh tools/slurm_train.sh PATITION_NAME JOB_NAME configs/retina/retina_pruning.py work_dir
# for slurm_test
sh tools/slurm_test.sh PATITION_NAME JOB_NAME configs/retina/retina_pruning.py PATH_CKPT --eval bbox
# for torch.dist
# sh tools/dist_train.sh configs/retina/retina_pruning.py 8

4. Finetune the model.

e.g. Detection

cd detection

Modify the deploy_from as the path to the pruned model in custom_hooks of xxxx_finetune.py

# for slurm train
sh tools/slurm_train.sh PATITION_NAME JOB_NAME configs/retina/retina_finetune.py work_dir
# for slurm test
sh tools/slurm_test.sh PATITION_NAME JOB_NAME configs/retina/retina_fintune.py PATH_CKPT --eval bbox
# for torch.dist
# sh tools/dist_train.sh configs/retina/retina_finetune.py 8

Models

Detection

Method	Backbone	Baseline(mAP)	Finetuned(mAP)	Download
RetinaNet	R-50-FPN	36.5	36.5	Baseline/Pruned/Finetuned
ATSS*	R-50-FPN	38.1	37.9	Baseline/Pruned/Finetuned
PAA*	R-50-FPN	39.0	39.4	Baseline/Pruned/Finetuned
FSAF	R-50-FPN	37.4	37.4	Baseline/Pruned/Finetuned

* indicate with no Group Normalization in heads.

Classification

Coming soon.

Please cite our paper in your publications if it helps your research.

@InProceedings{liu2021group,
  title = {Group Fisher Pruning for Practical Network Compression},
  author =       {Liu, Liyang and Zhang, Shilong and Kuang, Zhanghui and Zhou, Aojun and Xue, Jing-Hao and Wang, Xinjiang and Chen, Yimin and Yang, Wenming and Liao, Qingmin and Zhang, Wayne},
  booktitle = {Proceedings of the 38th International Conference on Machine Learning},
  year = {2021},
  series = {Proceedings of Machine Learning Research},
  month = {18--24 Jul},
  publisher ={PMLR},
}

Group Fisher Pruning for Practical Network Compression(ICML2021)

Related tags

Overview

Group Fisher Pruning for Practical Network Compression (ICML2021)

Updates

NOTES

Introduction

1. Compare with state-of-the-arts.

2. Can be applied to various complicated structures and various tasks.

3. Boosting inference speed on GPU under same flops.

Get Started

1. Creat a basic environment with pytorch 1.3.0 and mmcv-full

Due to the frequent changes of the autograd interface, we only guarantee the code works well in pytorch==1.3.0.

Please use gcc-5.4 and nvcc 10.0

2. Install the corresponding codebase in OpenMMLab.

3. Pruning the model.

4. Finetune the model.

Models

Detection

Classification

Please cite our paper in your publications if it helps your research.

Owner

Shilong Zhang

VQMIVC - Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion

[CVPR 2022] Official PyTorch Implementation for "Reference-based Video Super-Resolution Using Multi-Camera Video Triplets"

Custom Implementation of Non-Deep Networks

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

Created as part of CS50 AI's coursework. This AI makes use of knowledge entailment to calculate the best probabilities to win Minesweeper.

NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset

In real-world applications of machine learning, reliable and safe systems must consider measures of performance beyond standard test set accuracy

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

A wrapper around SageMaker ML Lineage Tracking extending ML Lineage to end-to-end ML lifecycles, including additional capabilities around Feature Store groups, queries, and other relevant artifacts.

POT : Python Optimal Transport

CLADE - Efficient Semantic Image Synthesis via Class-Adaptive Normalization (TPAMI 2021)

Addition of pseudotorsion caclulation eta, theta, eta', and theta' to barnaba package

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers

List of content farm sites like g.penzai.com.

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

A different spin on dataclasses.

Due to the frequent changes of the autograd interface, we only guarantee the code works well in `pytorch==1.3.0`.