Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Last update: Feb 23, 2022

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Recently, researchers proposed pruning deep neural network weights (DNNs) using an $N:M$ fine-grained block sparsity mask. In this mask, for each block of M weights, we have at least N zeros. In contrast to unstructured sparsity, N:M fine-grained block sparsity allows acceleration in actual modern hardware. Previously suggested solutions enabled DNN acceleration at the inference phase. To also allow such acceleration in the training phase, we suggest a novel transposable-fine-grained sparsity mask where the same mask can be used for both forward and backward passes. Our transposable mask ensures that both the weight matrix and its transpose follow the same sparsity pattern; thus the matrix multiplication required for passing the error backward can also be accelerated. We discuss the transposable constraint and devise a new measure for mask constraints, called mask-diversity (MD), which correlates with their expected accuracy. Lastly, we formulate the problem of finding the optimal transposable mask as a minimum-cost-flow problem and suggest a fast linear approximation that can be used when the masks dynamically change while training. Our experiments suggest 2x speed-up with no accuracy degradation over vision and language models. A reference implementation is available in the supplementary material.

Reproducing the results

This repository is partially based on convNet.pytorch repo. please ensure that you are using pytorch 1.7+. Reproducing AdaPrune results

cd AdaPrune
sh scripts/adaprune_dense_bnt.sh
sh scripts/adaprune_sparse.sh

Reproducing static NM-transposable starting from dense pre-trained model:

cd static_TNM
sh scripts/prune_pretrained_R50.sh

Reproducing dynamic NM-transposable from scratch:

cd dynamic_TNM
sh scripts/clone_and_copy.sh
sh scripts/run_R18.sh
sh scripts/run_R50.sh

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Related tags

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Reproducing the results

Owner

itay hubara

Fully featured implementation of Routing Transformer

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision

A Fast Command Analyser based on Dict and Pydantic

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

NLP topic mdel LDA - Gathered from New York Times website

Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed

Ecommerce product title recognition package

Code for the paper "Flexible Generation of Natural Language Deductions"

Word Bot for JKLM Bomb Party

Training RNNs as Fast as CNNs

A python wrapper around the ZPar parser for English.

Big Bird: Transformers for Longer Sequences

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

🌐 Translation microservice powered by AI

Learning Spatio-Temporal Transformer for Visual Tracking

Enterprise Scale NLP with Hugging Face & SageMaker Workshop series

Community and sentiment analysis based on tweets