QAT(quantize aware training) for classification with MQBench

Last update: Nov 18, 2022

Overview

MQBench Quantization Aware Training with PyTorch

I am using MQBench(Model Quantization Benchmark)(http://mqbench.tech/) to quantize the model for deployment.

MQBench is a benchmark and framework for evluating the quantization algorithms under real world hardware deployments.

Prerequisites

Python 3.7+
PyTorch 1.8.1+

Install MQBench Lib

Before run this repository, you should install MQBench:

git clone https://github.com/ModelTC/MQBench.git
cd MQBench
python setup.py build
python setup.py install

Training Fp32 Model

# Start training fp32 model with: 
# model_name can be ResNet18, MobileNet, ...
python main.py model_name

# You can manually config the training with: 
python main.py --resume --lr=0.01

Training Quantize Model

# Start training quantize model with: 
# model_name can be ResNet18, MobileNet, ...
python main.py model_name --quantize

# You can manually config the training with: 
python main.py --resume --parallel DP --BackendType Tensorrt --quantize
python -m torch.distributed.launch main.py --local_rank 0 --parallel DDP --resume  --BackendType Tensorrt --quantize

Fp32 Accuracy

Model	Acc.
VGG16	92.64%
ResNet18	93.02%
ResNet50	93.62%
ResNet101	93.75%
RegNetX_200MF	94.24%
RegNetY_400MF	94.29%
MobileNetV2	94.43%
ResNeXt29(32x4d)	94.73%
ResNeXt29(2x64d)	94.82%
SimpleDLA	94.89%
DenseNet121	95.04%
PreActResNet18	95.11%
DPN92	95.16%
DLA	95.47%

QAT(quantize aware training) for classification with MQBench

Related tags

Overview

MQBench Quantization Aware Training with PyTorch

Prerequisites

Install MQBench Lib

Training Fp32 Model

Training Quantize Model

Fp32 Accuracy

Owner

Ling Zhang

Python scripts for performing lane detection using the LSTR model in ONNX

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

RaceBERT -- A transformer based model to predict race and ethnicty from names

A PyTorch Implementation of "Neural Arithmetic Logic Units"

PFLD pytorch Implementation

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Repository for the paper "PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation", CVPR 2021.

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

Python package for visualizing the loss landscape of parameterized quantum algorithms.

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

A PyTorch Implementation of ViT (Vision Transformer)

PyTorch Code for NeurIPS 2021 paper Anti-Backdoor Learning: Training Clean Models on Poisoned Data.

Stochastic Normalizing Flows

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

A self-supervised learning framework for audio-visual speech

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Blender Python - Node-based multi-line text and image flowchart