Byzantine-robust decentralized learning via self-centered clipping

In this paper, we study the challenging task of Byzantine-robust decentralized training on arbitrary communication graphs. Unlike federated learning where workers communicate through a server, workers in the decentralized environment can only talk to their neighbors, making it harder to reach consensus. We identify a novel dissensus attack in which few malicious nodes can take advantage of information bottlenecks in the topology to poison the collaboration. To address these issues, we propose a Self-Centered Clipping (SSClip) algorithm for Byzantine-robust consensus and optimization, which is the first to provably converge to a $O(\delta_{\max}\zeta^2/\gamma^2)$ neighborhood of the stationary point for non-convex objectives under standard assumptions. Finally, we demonstrate the encouraging empirical performance of SSClip under a large number of attacks.

Structure of code
Reproduction
License
Reference

Code organization

The structure of the repository is as follows:

codes/
- Source code.
outputs/
- Store the output of the launcher scripts.
consensus.ipynb: Study the error of aggregators to the average consensus under dissensus attack.
- This notebook generates Fig. 3 in the main text and Fig. 8 in the appendix.
dumbbell.py: Study how topology + heterogeneity influence on the aggregators.
dumbbell_improvement.py: Study how to help aggregators to address topology + heterogeneity influence.
dumbbell.ipynb: Plot the results of dumbbell.py and dumbbell_improvement.py.
- Generate Fig. 4 in the main text.
optimization_delta.py: Fix p, zeta^2 and varying delta of dissensus attack for SCClip aggregator.
- Generate Fig. 5 in the main text.
honest_majority.py: Study the influence of honest majority in the text.
- Generate Fig. 6 in the main text.

Reproduction

To reproduce the results in the paper, do the following steps

Add codes/ to environment variable PYTHONPATH
Install the dependencies: pip install -r requirements.txt
Run bash run.sh and select option 2 to 9 to generate the code.
The output will be saved to the corresponding folders under outputs

Note that if the GPU memory is small (e.g. less than 16 GB), then running the previous commands may raise insufficient exception. In this case, one can decrease the level parallelism in the script by changing the order of loops and reduce the number of parallel processes.

License

This repo is covered under The MIT License.

Reference

TODO

Byzantine-robust decentralized learning via self-centered clipping

Related tags

Overview

Byzantine-robust decentralized learning via self-centered clipping

Table of contents

Code organization

Reproduction

License

Reference

Owner

EPFL Machine Learning and Optimization Laboratory

[NeurIPS 2021] The PyTorch implementation of paper "Self-Supervised Learning Disentangled Group Representation as Feature"

Everything you need to know about NumPy( Creating Arrays, Indexing, Math,Statistics,Reshaping).

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

The MATH Dataset

TJU Deep Learning & Neural Network

Framework web SnakeServer.

LSTM and QRNN Language Model Toolkit for PyTorch

Interpolation-based reduced-order models

Joint detection and tracking model named DEFT, or ``Detection Embeddings for Tracking.

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

Top #1 Submission code for the first https://alphamev.ai MEV competition with best AUC (0.9893) and MSE (0.0982).

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

An implementation of Deep Forest 2021.2.1.

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Privacy-Preserving Machine Learning (PPML) Tutorial Presented at PyConDE 2022

This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.