A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

Last update: Nov 01, 2022

Related tags

Deep Learning acgc

Overview

Code For AC-GC: Lossy Activation Compression with Guaranteed Convergence

This code is intended to be used as a supplemental material for submission to NeurIPS 2021.

DO NOT DISTRIBUTE

Setup

This code is tested on Ubuntu 20.04 with Python 3 and CUDA 10.1. Other cuda versions can be used by modifying the cupy version in requirements.txt, provided that CuDNN is installed.

# Set up environment
python3 -m venv
source venv/bin/activate
pip3 install -r requirements.txt

Training

Configurations are provided for CIFAR10/ResNet50 in the acgc/configs folder.

source venv/bin/activate
cd acgc
./configs/rn50_baseline.sh

To replicate GridQuantZ results from the paper, you additionally need to:

Run quantz with bitwidths of 2, 4, 6, 8, 10, 12, 14, and 16 bits, and run each 5 times
Select the result with the lowest bitwidth and average accuracy no less than the baseline - 0.1%

Evaluation

Evaluation with the CIFAR10 test dataset is run during training. The 'validation/main/accuracy' entry in the report.txt or log contains test accuracy throughout training.

Pre-trained Models

You can download pre-trained snapshots for each config from acgc/configs.

These snapshots can be run using

python3 train_cifar_act_error.py ... --resume <snapshot_file>

Results

We have added reports and logs for each configuration under acgc/results. The logs are associated with each snapshot, above.

A summarized output from these runs is:

Configuration	Best Test Acc	Average Bits	Epochs
rn50_baseline	95.16 %	N/A	300
rn50_quant_8bit	94.90 %	8.000	300
rn50_quantz_8bit	94.82 %	7.426	300
rn50_autoquant	94.73 %	7.305	300
rn50_autoquantz	94.91 %	6.694	300

Code Layout

Argument parsing and model initialization are handled in acgc/cifar.py and acgc/train_cifar_act_error.py

Modifications to the training loop are in acgc/common/compression/compressed_momentum_sgd.py.

The baseline fixpoint implementation is in acgc/common/compression/quant.py.

The AutoQuant implementation, and error bound calculation are in acgc/common/compression/autoquant.py.

Gradient and parameter estimation are performed in acgc/common/compression/grad_approx.py

A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

Related tags

Overview

Code For AC-GC: Lossy Activation Compression with Guaranteed Convergence

Setup

Training

Evaluation

Pre-trained Models

Results

Code Layout

Owner

Dave Evans

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

Dynamic Bottleneck for Robust Self-Supervised Exploration

Medical Insurance Cost Prediction using Machine earning

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

Reinforcement Learning for the Blackjack

Decorator for PyMC3

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Creating a custom CNN hypertunned architeture for the Fashion MNIST dataset with Python, Keras and Tensorflow.

Simulated garment dataset for virtual try-on

Codes for "Template-free Prompt Tuning for Few-shot NER".

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Distributed Asynchronous Hyperparameter Optimization in Python

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Image-to-image regression with uncertainty quantification in PyTorch

Simple Dynamic Batching Inference