Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Last update: Jan 06, 2023

Overview

RIIT

Our open-source code for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implement and standardize the hyperparameters of numerous QMIX variant algorithms that achieve SOTA.

Python MARL framework

PyMARL is WhiRL's framework for deep multi-agent reinforcement learning and includes implementations of the following algorithms:

Value-based Methods:

Actor Critic Methods:

PyMARL is written in PyTorch and uses SMAC as its environment.

Installation instructions

Install Python packages

# require Anaconda 3 or Miniconda 3
bash install_dependecies.sh

Set up StarCraft II and SMAC:

bash install_sc2.sh

This will download SC2 into the 3rdparty folder and copy the maps necessary to run over.

Run an experiment

# For SMAC
python3 src/main.py --config=qmix --env-config=sc2 with env_args.map_name=corridor

# For Cooperative Predator-Prey
python3 src/main.py --config=qmix_prey --env-config=stag_hunt with env_args.map_name=stag_hunt

The config files act as defaults for an algorithm or environment.

They are all located in src/config. --config refers to the config files in src/config/algs --env-config refers to the config files in src/config/envs

Run parallel experiments:

# bash run.sh config_name map_name_list (threads_num arg_list gpu_list experinments_num)
bash run.sh qmix corridor 2 epsilon_anneal_time=500000 0,1 5

xxx_list is separated by ,.

All results will be stored in the Results folder and named with map_name.

Force all trainning processes to exit

# all python and game processes of current user will quit.
bash clean.sh

Some test results on Super Hard scenarios

Cite

@article{hu2021riit,
      title={RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning}, 
      author={Jian Hu and Siyang Jiang and Seth Austin Harding and Haibin Wu and Shih-wei Liao},
      year={2021},
      eprint={2102.03479},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Related tags

Overview

RIIT

Python MARL framework

Installation instructions

Run an experiment

Run parallel experiments:

Force all trainning processes to exit

Some test results on Super Hard scenarios

Cite

Owner

TrackTech: Real-time tracking of subjects and objects on multiple cameras

This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Multiple paper open-source codes of the Microsoft Research Asia DKI group

The comma.ai Calibration Challenge!

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

Clustering with variational Bayes and population Monte Carlo

Politecnico of Turin Thesis: "Implementation and Evaluation of an Educational Chatbot based on NLP Techniques"

a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.

Preparation material for Dropbox interviews

Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

Weakly-supervised object detection.

A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.

Using a Seq2Seq RNN architecture via TensorFlow to predict future Bitcoin prices

Code implementation of Data Efficient Stagewise Knowledge Distillation paper.

competitions-v2

A naive ROS interface for visualDet3D.

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Hooks for VCOCO