Efficient Lottery Ticket Finding: Less Data is More

Last update: Sep 04, 2022

Overview

Efficient Lottery Ticket Finding: Less Data is More

Codes for this paper Efficient Lottery Ticket Finding: Less Data is More. [ICML 2021]

Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang

Overview

The lottery ticket hypothesis (LTH) reveals the existence of winning tickets (sparse but critical subnetworks) for dense networks, that can be trained in isolation from random initialization to match the latter’s accuracies. However, finding winning tickets requires burdensome computations in the train-prune-retrain process, especially on large-scale datasets (e.g., ImageNet), restricting their practical benefits. This paper explores a new perspective on finding lottery tickets more efficiently, by doing so only with a specially selected subset of data, called Pruning- Aware Critical set (PrAC set), rather than using the full training set. The concept of PrAC set was inspired by the recent observation, that deep networks have samples that are either hard to memorize during training, or easy to forget during pruning. A PrAC set is thus hypothesized to capture those most challenging and informative examples for the dense model. We observe that a high-quality winning ticket can be found with training and pruning the dense network on the very compact PrAC set, which can substantially save training iterations for the ticket finding process.

Prerequisites

Pytorch >= 1.4

torchvision

advertorch

Usage

Vanilla Lottery Tickets

python -u main_imp.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--batch_size 128 \
	--lr 0.1 \
	--pruning_times 16 \
	--prune_type rewind_lt \
	--rewind_epoch 2 \
	--save_dir lt_cifar10_res20s

PrAC Lottery Tickets

python -u main_PrAC_imp.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--split_file npy_files/cifar10-train-val.npy \
	--batch_size 128 \
	--lr 0.1 \
	--pruning_times 16 \
	--eb_eps 0.08 \
	--prune_type rewind_lt \
	--rewind_epoch 2 \
	--threshold 0 \
	--save_dir PrAC_lt_cifar10_res20s

Train subnetworks

python -u main_train.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--batch_size 128 \
	--lr 0.1 \
	--init_dir PrAC_lt_cifar10_res20s/1checkpoint.pth.tar \ 
	--mask_dir PrAC_lt_cifar10_res20s/1checkpoint.pth.tar \ # sparsity=20%
	--save_dir retrain_PrAC_lt_cifar10_res20s/1

Efficient Lottery Ticket Finding: Less Data is More

Related tags

Overview

Efficient Lottery Ticket Finding: Less Data is More

Overview

Prerequisites

Usage

Vanilla Lottery Tickets

PrAC Lottery Tickets

Train subnetworks

Citation

Owner

VITA

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

kapre: Keras Audio Preprocessors

This project contains an implemented version of Face Detection using OpenCV and Mediapipe. This is a code snippet and can be used in projects.

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Tandem Mass Spectrum Prediction with Graph Transformers

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

A transformer model to predict pathogenic mutations

A baseline code for VSPW

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

PyTorch implementation of UNet++ (Nested U-Net).

Dist2Dec: A Simplicial Neural Network for Homology Localization

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

Frigate - NVR With Realtime Object Detection for IP Cameras

Our implementation used for the MICCAI 2021 FLARE Challenge titled 'Efficient Multi-Organ Segmentation Using SpatialConfiguartion-Net with Low GPU Memory Requirements'.

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

SARS-Cov-2 Recombinant Finder for fasta sequences