[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

[email protected]">

Last update: Dec 15, 2022

Related tags

Overview

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

This is the PyTorch implemention of our paper FedBN: Federated Learning on Non-IID Features via Local Batch Normalization by Xiaoxiao Li, Meirui Jiang, Xiaofei Zhang, Michael Kamp and Qi Dou

Abstract

The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy. In most cases, the assumption of independent and identically distributed samples across local clients does not hold for federated learning setups. Under this setting, neural network training performance may vary significantly according to the data distribution and even hurt training convergence. Most of the previous work has focused on a difference in the distribution of labels. Unlike those settings, we address an important problem of FL, e.g., different scanner/sensors in medical imaging, different scenery distribution in autonomous driving (highway vs. city), where local clients may store examples with different marginal or conditional feature distributions compared to other nodes, which we denote as feature shift non-iid. In this work, we propose an effective method that uses local batch normalization to alleviate the feature shift before averaging models. The resulting scheme, called FedBN, outperforms both classical FedAvg, as well as the state-of-the-art for non-iid data (FedProx) on our extensive experiments. These empirical results are supported by a convergence analysis that shows in a simplified setting that FedBN has a faster convergence rate in expectation than FedAvg.

Usage

Setup

pip

See the requirements.txt for environment configuration.

pip install -r requirements.txt

conda

We recommend using conda to quick setup the environment. Please use the following commands.

conda env create -f environment.yaml
conda activate fedbn

Dataset & Pretrained Modeel

Benchmark(Digits)

Please download our pre-processed datasets here, put under data/ directory and perform following commands:
```
cd ./data
unzip digit_dataset.zip
```
Please download our pretrained model here and put under snapshots/ directory, perform following commands:
```
cd ./snapshots
unzip digit_model.zip
```

office-caltech10

Please download our pre-processed datasets here, put under data/ directory and perform following commands:
```
cd ./data
unzip office_caltech_10_dataset.zip
```
Please download our pretrained model here and put under snapshots/ directory, perform following commands:
```
cd ./snapshots
unzip office_caltech_10_model.zip
```

DomainNet

Please first download our splition here, put under data/ directory and perform following commands:
```
cd ./data
unzip domainnet_dataset.zip
```
then download dataset including: Clipart, Infograph, Painting, Quickdraw, Real, Sketch, put under data/DomainNet directory and unzip them.
```
cd ./data/DomainNet
unzip [filename].zip
```
Please download our pretrained model here and put under snapshots/ directory, perform following commands:
```
cd ./snapshots
unzip domainnet_model.zip
```

Train

Federated Learning

Please using following commands to train a model with federated learning strategy.

--mode specify federated learning strategy, option: fedavg | fedprox | fedbn

cd federated
# benchmark experiment
python fed_digits.py --mode fedbn

# office-caltech-10 experiment
python fed_office.py --mode fedbn

# DomaiNnet experiment
python fed_domainnet.py --mode fedbn

SingleSet

Please using following commands to train a model using singleset data.

--data specify the single dataset

cd singleset 
# benchmark experiment, --data option: svhn | usps | synth | mnistm | mnist
python single_digits.py --data svhn

# office-caltech-10 experiment --data option: amazon | caltech | dslr | webcam
python single_office.py --data amazon

# DomaiNnet experiment --data option: clipart | infograph | painting | quickdraw | real | sketch
python single_domainnet.py --data clipart

Test

cd federated
# benchmark experiment
python fed_digits.py --mode fedbn --test

# office-caltech-10 experiment
python fed_office.py --mode fedbn --test

# DomaiNnet experiment
python fed_domainnet.py --mode fedbn --test

Citation

If you find the code and dataset useful, please cite our paper.

@inproceedings{
li2021fedbn,
title={Fed{\{}BN{\}}: Federated Learning on Non-{\{}IID{\}} Features via Local Batch Normalization},
author={Xiaoxiao Li and Meirui JIANG and Xiaofei Zhang and Michael Kamp and Qi Dou},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=6YEQUn0QICG}
}

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Related tags

Overview

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Abstract

Usage

Setup

Dataset & Pretrained Modeel

Train

Test

Citation

Owner

[email protected]

Fully Adaptive Bayesian Algorithm for Data Analysis (FABADA) is a new approach of noise reduction methods. In this repository is shown the package developed for this new method based on \citepaper.

Website which uses Deep Learning to generate horror stories.

Inkscape extensions for figure resizing and editing

A decent AI that solves daily Wordle puzzles. Works with different websites with similar wordlists,.

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Analysis code and Latex source of the manuscript describing the conditional permutation test of confounding bias in predictive modelling.

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

This is a virtual picture dragging application. Users may virtually slide photos across the screen. The distance between the index and middle fingers determines the movement. Smaller distances indicate click and motion, whereas bigger distances indicate only hand movement.

Using a Seq2Seq RNN architecture via TensorFlow to predict future Bitcoin prices

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Dilated Convolution for Semantic Image Segmentation

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

Cross-modal Deep Face Normals with Deactivable Skip Connections

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

CS583: Deep Learning

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.