MD-PGT

Repository for implementing and reproducing the results for the paper MDPGT: Momentum-based Decentralized Policy Gradient Tracking: https://arxiv.org/abs/2112.02813

Available Environments

Lineworld
Particle-world (installation instructions available at https://github.com/xylee95/MDPGT_particleworld

Available Agents

DPG: Decentralized Policy Gradients
MDPG : Momentum Decentralized Policy Gradients
MDPGT : Momentum-based Decentralized Policy Gradient Tracking

Main files used are:

train_lineworld_dpg.py
train_lineworld_mdpg.py
train_lineworld_mdpgt.py
train_particleworld_dpg.py
train_particleworld_mdpg.py
train_particleworld_mdpgt.py
model.py : code for policy network and related functions
update_functions.py : all functions related to update rules and consensus for MDPG and MDPGT

Both MDPG and MDPGT has the option of using Minibatch Initialization to compute batch gradient surrogate.

Reproducing the results:

To reproduce the results shown in the paper, please check run_exp.sh for the relevant python commands.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
envs		envs
generate_topology		generate_topology
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
model.py		model.py
run_exp.sh		run_exp.sh
train_lineworld_dpg.py		train_lineworld_dpg.py
train_lineworld_mdpg.py		train_lineworld_mdpg.py
train_lineworld_mdpgt.py		train_lineworld_mdpgt.py
train_particleworld_dpg.py		train_particleworld_dpg.py
train_particleworld_mdpg.py		train_particleworld_mdpg.py
train_particleworld_mdpgt.py		train_particleworld_mdpgt.py
update_functions.py		update_functions.py

xylee95/MD-PGT

Folders and files

Latest commit

History

Repository files navigation

MD-PGT

Available Environments

Available Agents

Main files used are:

Reproducing the results:

About

Resources

Stars

Watchers

Forks

Languages