Evolving Neural Networks in JAX

This repository holds code displaying techniques for applying evolutionary network training strategies in JAX. Each script trains a network to solve the same problem: given a sequence of regularly-spaced values on a sine wave, predict the next value. The problem is trivial - the interesting part is intended to be the way in which this is accomplished, by updating network parameters directly and without gradient calculations, in parallel across devices. A lengthy tutorial is included, explaining the ideas and rationale. Much of the code is duplicated between scripts so that readers can run them individually and, if they like, view the differences between files to see what changes in each section.

The evolutionary ideas present here are mainly taken from OpenAI's blog post describing their efforts at scaling evolution strategies (and the associated code.)

tutorial.md

A longform tutorial that explains why I think evolutionary optimization strategies are interesting and some of the JAX techniques that I use to implement them. Individual bits of the code in each of the script files are discussed here.

simple.py

In this file, a very basic evolutionary strategy is implemented, without many optimizations. You can get a grasp here on how some fundamental JAX methods like scan and vmap are used to execute our training routine.

advanced.py

Here, some optimizations that OpenAI made in their code are added to our training routine. The various optimizations are discussed in depth in the article.

parallel.py

In this file, we prepare to scale the network to more than one device and to greater sizes. Vectorization becomes parallelization, and the code is sliced up so that we can calculate our network updates on a single device.

Evolving neural network parameters in JAX.

Related tags

Overview

Evolving Neural Networks in JAX

tutorial.md

simple.py

advanced.py

parallel.py

Owner

Trevor Thackston

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

This repository contains a CBIR system that uses swin transformer to extract image's feature.

🕹️ Official Implementation of Conditional Motion In-betweening (CMIB) 🏃

Pytorch Implementation for Dilated Continuous Random Field

Rax is a Learning-to-Rank library written in JAX

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

PyTorch implementation of an end-to-end Handwritten Text Recognition (HTR) system based on attention encoder-decoder networks

DISTIL: Deep dIverSified inTeractIve Learning.

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Simple and ready-to-use tutorials for TensorFlow

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

The materials used in the SaxonJS tutorial presented at Declarative Amsterdam, 2021

Exporter for Storage Area Network (SAN)

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

ImageNet Adversarial Image Evaluation

Implementation of Restricted Boltzmann Machine (RBM) and its variants in Tensorflow