Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Last update: Sep 19, 2022

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

This is the official codebase for Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL. Here, we provide a sample implementation of SAFARI on the cooperative navigation environment. This specific repository is untested; however, many of the given files match the code used to run experiments in the paper exactly. Refer to agents/safari.py.

Requirements

To install requirements, run:

pip install -r requirements.txt

Not all dependencies may be used; however, all dependencies that are needed can be found here.

Run

To kick off a training run of SAFARI, add a dataset into the data/ folder. Then running:

python main.py safari

will start the script from the entry point, main.py.

Data Format

SAFARI expects there to be a dataset present at data/ / for each parallel seed that is run. We expect three files:

actions.txt (Shape: [N, H])
rewards.txt (Shape: [N, H])
obs.txt (Shape: [N, H, O])

each of which expects each line to be an episodic trajectory. We convert each buffer into a list (1), cast them to str (2), and print them on separate lines of the file (3).

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Requirements

Run

Data Format

Owner

Physics-informed convolutional-recurrent neural networks for solving spatiotemporal PDEs

Causal Imitative Model for Autonomous Driving

A tiny, pedagogical neural network library with a pytorch-like API.

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Embracing Single Stride 3D Object Detector with Sparse Transformer

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

Pytorch cuda extension of grid_sample1d

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.

A Genetic Programming platform for Python with TensorFlow for wicked-fast CPU and GPU support.

A new video text spotting framework with Transformer

💡 Learnergy is a Python library for energy-based machine learning models.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

[CVPR'22] COAP: Learning Compositional Occupancy of People

Predicting lncRNA–protein interactions based on graph autoencoders and collaborative training

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

A parametric soroban written with CADQuery.

Testing the Facial Emotion Recognition (FER) algorithm on animations

Official implementation of TMANet.

Joint parameterization and fitting of stroke clusters

Self-Supervised Learning with Kernel Dependence Maximization