A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

MoveNet Single Pose on DepthAI

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

This repository contains the segmentation user interface from the OpenSurfaces project, extracted as a lightweight tool

Python script that allows you to automatically setup your Growtopia server.

Implementation of gaze tracking and demo

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

WRENCH: Weak supeRvision bENCHmark

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

Reimplement of SimSwap training code

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

A PyTorch implementation of Implicit Q-Learning

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

Kroomsa: A search engine for the curious

PerfFuzz: Automatically Generate Pathological Inputs for C/C++ programs

A set of tools for Namebase and HNS