Anderson Accelerated Deep Learning (AADL)

AADL is a Python package that implements the Anderson acceleration to speed-up the training of deep learning (DL) models using the PyTorch library.
AA is an extrapolation technique that can accelerate fixed-point iterations such those arising from the iterative training of DL models. However, large volume of data are typically processed in sequential random batches which introduces stochastic oscillations in the fixed-point iteration that hinders AA acceleration. AADL implements a moving average that reduces the oscillations and results in a smoother sequence of gradient descent updates which enables the use of AA. AADL uses a criterion to automatically decide if the moving average is needed by monitoring if the relative standard deviation between consecutive stochastic gradient updates exceeds a tolerance defined by the user.

Requirements

Python 3.5 or greater
PyTorch (any version works)

Installation

AADL comes with a setuptools install script:

python3 setup.py install

Usage

import torch
import torch.nn
import torch.optim
import AADL

# Creation of the DL model (neural network)
class model(torch.nn.Module):
	...

# Definition of the stochastic optimizer used to train the model
optimizer = torch.optim.SGD(model.parameters(), lr=1e-3, momentum=0.9, nesterov = True)

# Parameters for Anderson acceleration
relaxation = 0.5
wait_iterations = 0
history_depth = 10
store_each_nth = 10
frequency = store_each_nth
reg_acc = 0.0
safeguard = True
average = True

# Over-writing of the torch.optim.step() method 
AADL.accelerate(optimizer_anderson, "anderson", relaxation, wait_iterations, history_depth, store_each_nth, frequency, reg_acc, average)

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

BSD-3-Clause

Citations

"AADL: Anderson Accelerated Deep Learning", Copyright ID#: 81927550 https://doi.org/10.11578/dc.20210723.1

Anderson Acceleration for Deep Learning

Related tags

Overview

Anderson Accelerated Deep Learning (AADL)

Requirements

Installation

Usage

Contributing

License

Citations

Owner

Oak Ridge National Laboratory

YOLOv5 in PyTorch > ONNX > CoreML > TFLite

NeuralForecast is a Python library for time series forecasting with deep learning models

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

Pytorch implementation of ProjectedGAN

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Graph Neural Networks with Keras and Tensorflow 2.

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

Riemann Noise Injection With PyTorch

PyTorch Autoencoders - Implementing a Variational Autoencoder (VAE) Series in Pytorch.

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Low-dose Digital Mammography with Deep Learning

Open-Ended Commonsense Reasoning (NAACL 2021)

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)