Experiments for distributed optimization algorithms

Last update: Dec 04, 2022

Overview

Network-Distributed Algorithm Experiments

This repository contains a set of optimization algorithms and objective functions, and all code needed to reproduce experiments in:

"DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization" [PDF]. (code is in this file [link])
"Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction" [PDF]. (code is in the previous version of this repo [link])

Due to the random data generation procedure, resulting graphs may be slightly different from those appeared in the paper, but conclusions remain the same.

If you find this code useful, please cite our papers:

@article{li2021destress,
  title={DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization},
  author={Li, Boyue and Li, Zhize and Chi, Yuejie},
  journal={arXiv preprint arXiv:2110.01165},
  year={2021}
}

@article{li2020communication,
  title={Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction},
  author={Li, Boyue and Cen, Shicong and Chen, Yuxin and Chi, Yuejie},
  journal={Journal of Machine Learning Research},
  volume={21},
  pages={1--51},
  year={2020}
}

Implemented objective functions

The gradient implementations of all objective functions are checked numerically.

Linear regression

Linear regression with random generated data. The objective function is $f(w) = \frac{1}{N} \sum_i (y_i - x_i^\top w)^2$

Logistic regression

Logistic regression with $l$-2 or nonconvex regularization with random generated data or the Gisette dataset or datasets from libsvmtools. The objective function is $$ f(w) = - \frac{1}{N} * \Big(\sum_i y_i \log \frac{1}{1 + exp(w^T x_i)} + (1 - y_i) \log \frac{exp(w^T x_i)}{1 + exp(w^T x_i)} \Big) + \frac{\lambda}{2} | w |_2^2 + \alpha \sum_j \frac{w_j^2}{1 + w_j^2} $$

One-hidden-layer fully-connected neural netowrk

One-hidden-layer fully-connected neural network with softmax loss on the MNIST dataset.

Implemented optimization algorithms

Centralized optimization algorithms

Gradient descent
Stochastic gradient descent
Nesterov's accelerated gradient descent
SVRG
SARAH

Distributed optimization algorithms (i.e. with parameter server)

ADMM
DANE

Decentralized optimization algorithms

Decentralized gradient descent
Decentralized stochastic gradient descent
Decentralized gradient descent with gradient tracking
EXTRA
NIDS
Network-DANE/SARAH/SVRG
GT-SARAH
DESTRESS

Experiments for distributed optimization algorithms

Related tags

Overview

Network-Distributed Algorithm Experiments

Implemented objective functions

Linear regression

Logistic regression

One-hidden-layer fully-connected neural netowrk

Implemented optimization algorithms

Centralized optimization algorithms

Distributed optimization algorithms (i.e. with parameter server)

Decentralized optimization algorithms

Owner

Boyue Li

Repository for the electrical and ICT benchmark model developed in the ERIGrid 2.0 project.

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Tello Drone Trajectory Tracking

Instance Semantic Segmentation List

Setup freqtrade/freqUI on Heroku

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

External Attention Network

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Hough Transform and Hough Line Transform Using OpenCV

Neural Message Passing for Computer Vision

Multi-scale discriminator feature-wise loss function

Code release to accompany paper "Geometry-Aware Gradient Algorithms for Neural Architecture Search."

Autoencoder - Reducing the Dimensionality of Data with Neural Network

A fast and easy to use, moddable, Python based Minecraft server!

It's a implement of this paper：Relation extraction via Multi-Level attention CNNs