BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Last update: Dec 02, 2022

Overview

BitPack

BitPack is a practical tool that can efficiently save quantized neural network models with mixed bitwidth.

Installation

PyTorch version >= 1.4.0
Python version >= 3.5
To install Bitpack simply run:

git clone https://github.com/Zhen-Dong/BitPack.git
cd BitPack

Usage

We can use BitPack pack.py to save integer checkpoints with various bitwidth, and use BitPack unpack.py to load the packed checkpoint, as shown in the demo.
To pack integer values that are saved in floating point format, add --force-pack-fp in the command.
To directly save packed checkpoint in PyTorch, please use save_quantized_state_dict() and load_quantized_state_dict() in pytorch_interface.py. If you don't want to operate jointly on state_dict, then codes inside the for loop of those two functions can be applied on every quantized tensor (ultra low-precision integer tensors) in various quantization frameworks.

Quick Start

BitPack is handy to use on various quantization frameworks. Here we show a demo that applying BitPack to save mixed-precision model generated by HAWQ.

export CUDA_VISIBLE_DEVICES=0
python pack.py --input-int-file quantized_checkpoint.pth.tar --force-pack-fp
python unpack.py --input-packed-file packed_quantized_checkpoint.pth.tar --original-int-file quantized_checkpoint.pth.tar

To get a better sense of how BitPack works, we provide a simple test that compares the original tensor, the packed tensor, and the unpacked tensor in details.

cd bitpack
python bitpack_utils.py

Results of BitPack on ResNet50

Original Precision	Quantization	Original Size(MB)	Packed Size(MB)	Compression Ratio
Floating Point	Mixed-Precision(4bit/8bit)	102	13.8	7.4x
8-bit	Mixed-Precision(2bit/8bit)	26	7.9	3.3x

Special Notes

unpack.py can be used for checking correctness. It loads and unpacks the packed model, and then compares it with the original model.

License

BitPack is released under the MIT license.

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Related tags

Overview

BitPack

Installation

Usage

Quick Start

Results of BitPack on ResNet50

Special Notes

License

Owner

Zhen Dong

Simple PyTorch implementations of Badnets on MNIST and CIFAR10.

On Generating Extended Summaries of Long Documents

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Python version of the amazing Reaction Mechanism Generator (RMG).

Python implementation of Wu et al (2018)'s registration fusion

Blender Python - Node-based multi-line text and image flowchart

A script that trains a model to recognize handwritten digits using the MNIST data set.

Official implementation of DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations in TensorFlow 2

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

This is a library for training and applying sparse fine-tunings with torch and transformers.

Sketch-Based 3D Exploration with Stacked Generative Adversarial Networks

Collection of common code that's shared among different research projects in FAIR computer vision team.

A blender add-on that automatically re-aligns wrong axis objects.

A rule-based log analyzer & filter

An imperfect information game is a type of game with asymmetric information