Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Last update: Nov 16, 2021

Related tags

Deep Learning marl-design

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Official implementation of:

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Shriram Chennakesavalu and Grant M. Rotskoff

https://arxiv.org/abs/2111.06875

Abstract: Experimental advances enabling high-resolution external control create new opportunities to produce materials with exotic properties. In this work, we investigate how a multi-agent reinforcement learning approach can be used to design external control protocols for self-assembly. We find that a fully decentralized approach performs remarkably well even with a "coarse" level of external control. More importantly, we see that a partially decentralized approach, where we include information about the local environment allows us to better control our system towards some target distribution. We explain this by analyzing our approach as a partially-observed Markov decision process. With a partially decentralized approach, the agent is able to act more presciently, both by preventing the formation of undesirable structures and by better stabilizing target structures as compared to a fully decentralized approach.

Installing prerequisites (using conda)

conda env create -f environment.yml -n marldesign
conda activate marldesign

Possible --centralize_approach values are ("plaquette", "all", "grid_n"), where 1 < n < region_num/2

Sample training commands

python train.py --active --centralize_states --centralize_approach plaquette
python train.py --active --centralize_rewards --centralize_approach all
python train.py --centralize_rewards --centralize_states --centralize_approach grid_1

Sample testing commands

python test.py --active --num_samples 10  --centralize_states --centralize_approach plaquette
python test.py --active --num_samples 10 --centralize_rewards --centralize_approach grid_1
python test.py --centralize_rewards --num_samples 10 --centralize_states --centralize_approach grid_2

For a more theoretical description of the systems described here, please visit https://github.com/rotskoff-group/dissipative-design

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Related tags

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Installing prerequisites (using conda)

Sample training commands

Sample testing commands

Owner

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

PyTorch implementation of Tacotron speech synthesis model.

Sharpened cosine similarity torch - A Sharpened Cosine Similarity layer for PyTorch

A fast MoE impl for PyTorch

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

This is an open source python repository for various python tests

Automatic Image Background Subtraction

[AI6122] Text Data Management & Processing

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Domain Generalization for Mammography Detection via Multi-style and Multi-view Contrastive Learning

Contains code for Deep Kernelized Dense Geometric Matching

A Tensorfflow implementation of Attend, Infer, Repeat

Kaggleship: Kaggle Notebooks

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

A simple image/video to Desmos graph converter run locally

Pytorch implementation of four neural network based domain adaptation techniques: DeepCORAL, DDC, CDAN and CDAN+E. Evaluated on benchmark dataset Office31.

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

Code for "My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack" paper

This repository contains the source code for the paper Tutorial on amortized optimization for learning to optimize over continuous domains by Brandon Amos