[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Last update: Dec 30, 2022

Overview

BNN - BN = ? Training Binary Neural Networks without Batch Normalization

Codes for this paper BNN - BN = ? Training Binary Neural Networks without Batch Normalization. [CVPR BiVision Workshop 2021]

Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang.

Overview

Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle for the efficient implementation of BNN training. It also introduces undesirable dependence between samples within each batch.

Inspired by the latest advance on Batch Normalization Free (BN-Free) training, we extend their framework to training BNNs, and for the first time demonstrate that BNs can be completely removed from BNN training and inference regimes. By plugging in and customizing techniques including adaptive gradient clipping, scale weight standardization, and specialized bottleneck block, a BN-free BNN is capable of maintaining competitive accuracy compared to its BN-based counterpart. Experimental results can be found in our paper.

BN-Free Binary Neural Networks

Reproduce

Environment

pytorch == 1.5.0
torchvision == 0.6.0
timm == 0.4.5

Training on ImageNet

./script/imagenet_reactnet_A_bf.sh (BN-Free ReActNet-A)
./script/imagenet_reactnet_A_bn.sh (with BN ReActNet-A)
./script/imagenet_reactnet_A_none.sh (without BN ReActNet-A)

Citation

@article{gaur2020training,
  title={Training Deep Neural Networks Without Batch Normalization},
  author={Gaur, Divya and Folz, Joachim and Dengel, Andreas},
  journal={arXiv preprint arXiv:2008.07970},
  year={2020}
}

Acknowledgement

https://github.com/liuzechun/ReActNet

https://github.com/liuzechun/Bi-Real-net

https://github.com/vballoli/nfnets-pytorch

https://github.com/deepmind/deepmind-research/tree/master/nfnets

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Related tags

Overview

BNN - BN = ? Training Binary Neural Networks without Batch Normalization

Overview

BN-Free Binary Neural Networks

Reproduce

Environment

Training on ImageNet

Citation

Acknowledgement

Owner

VITA

Generative Models for Graph-Based Protein Design

Machine Learning Model deployment for Container (TensorFlow Serving)

Semantic graph parser based on Categorial grammars

Some pre-commit hooks for OpenMMLab projects

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

End-to-End Object Detection with Fully Convolutional Network

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

Revisiting Self-Training for Few-Shot Learning of Language Model.

A flexible ML framework built to simplify medical image reconstruction and analysis experimentation.

Dynamic Head: Unifying Object Detection Heads with Attentions

Julia package for multiway (inverse) covariance estimation.

This is the code repository implementing the paper "TreePartNet: Neural Decomposition of Point Clouds for 3D Tree Reconstruction".

Tensorflow implementation of "Learning Deconvolution Network for Semantic Segmentation"

Pytorch implementation of set transformer

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain