Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Last update: Dec 25, 2022

Overview

MARL Tricks

Our codes for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implemented and standardized the hyperparameters of the SOTA MARL algorithms.

Python MARL framework

PyMARL is WhiRL's framework for deep multi-agent reinforcement learning and includes implementations of the following algorithms:

Value-based Methods:

Actor Critic Methods:

PyMARL is written in PyTorch and uses SMAC as its environment.

Installation instructions

Install Python packages

# require Anaconda 3 or Miniconda 3
bash install_dependecies.sh

Set up StarCraft II and SMAC:

bash install_sc2.sh

This will download SC2 into the 3rdparty folder and copy the maps necessary to run over.

Run an experiment

# For SMAC
python3 src/main.py --config=qmix --env-config=sc2 with env_args.map_name=corridor

# For Cooperative Predator-Prey
python3 src/main.py --config=qmix_prey --env-config=stag_hunt with env_args.map_name=stag_hunt

The config files act as defaults for an algorithm or environment.

They are all located in src/config. --config refers to the config files in src/config/algs --env-config refers to the config files in src/config/envs

Run parallel experiments:

# bash run.sh config_name map_name_list (threads_num arg_list gpu_list experinments_num)
bash run.sh qmix corridor 2 epsilon_anneal_time=500000 0,1 5

xxx_list is separated by ,.

All results will be stored in the Results folder and named with map_name.

Force all processes to exit

# all python and game processes of current user will quit.
bash clean.sh

Some test results on Super Hard scenarios

Cite

@article{hu2021riit,
      title={RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning}, 
      author={Jian Hu and Haibin Wu and Seth Austin Harding and Siyang Jiang and Shih-wei Liao},
      year={2021},
      eprint={2102.03479},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Related tags

Overview

MARL Tricks

Python MARL framework

Installation instructions

Run an experiment

Run parallel experiments:

Force all processes to exit

Some test results on Super Hard scenarios

Cite

Owner

ChainerRL is a deep reinforcement learning library built on top of Chainer.

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

A general-purpose multi-agent training framework.

Open world survival environment for reinforcement learning

A toolkit for developing and comparing reinforcement learning algorithms.

Tensorforce: a TensorFlow library for applied reinforcement learning

A toolkit for reproducible reinforcement learning research.

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

A customisable 3D platform for agent-based AI research

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

An open source robotics benchmark for meta- and multi-task reinforcement learning

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Retro Games in Gym

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

This is the official implementation of Multi-Agent PPO.