A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

A Protein-RNA Interface Predictor Based on Semantics of Sequences

Code for the bachelors-thesis flaky fault localization

Code for our CVPR 2021 paper "MetaCam+DSCE"

Code for: Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space. Nicholas Monath, Manzil Zaheer, Daniel Silva, Andrew McCallum, Amr Ahmed. KDD 2019.

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

XViT - Space-time Mixing Attention for Video Transformer

Covid19-Forecasting - An interactive website that tracks, models and predicts COVID-19 Cases

COVINS -- A Framework for Collaborative Visual-Inertial SLAM and Multi-Agent 3D Mapping

ZeroVL - The official implementation of ZeroVL

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

MEND: Model Editing Networks using Gradient Decomposition

Bayes-Newton—A Gaussian process library in JAX, with a unifying view of approximate Bayesian inference as variants of Newton's algorithm.

Official implementation of Densely connected normalizing flows

H&M Fashion Image similarity search with Weaviate and DocArray

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Gradient Step Denoiser for convergent Plug-and-Play

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

This repository contains tutorials for the py4DSTEM Python package

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.