Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Last update: Jan 01, 2023

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

MACAW code used for the experiments in the ICML 2021 paper.

Installing the environment

# Install Python 3.7.9 if necessary
$ pyenv install 3.7.9
$ pyenv shell 3.7.9

$ python --version
Python 3.7.9

$ python -m venv env
$ source env/bin/activate
$ pip install -r requirements.txt

Downloading the data

The offline data used for MACAW can be found here. Download it and use the default name (macaw_offline_data) for the folder where the four data directories are stored. gDrive might be useful here if downloading from the Google Drive GUI is not an option.

Running MACAW 🦜

Run offline meta-training with periodic online evaluations with any of the scripts in scripts/. e.g.

$ . scripts/macaw_dir.sh # MACAW training on Cheetah-Direction (Figure 1)
$ . scripts/macaw_vel.sh # MACAW training on Cheetah-Velocity (Figure 1)
$ . scripts/macaw_quality_ablation.sh # Data quality ablation (Figure 5-left)
...

Outputs (tensorboard logs) will be written to the log/ directory.

Reach out!

If you're having issues with the code or data, feel free to open an issue or send me an email.

Citation

If our code or research was useful for your own work, you can cite us with the following attribution:

@InProceedings{mitchell2021offline,
    title = {Offline Meta-Reinforcement Learning with Advantage Weighting},
    author = {Mitchell, Eric and Rafailov, Rafael and Peng, Xue Bin and Levine, Sergey and Finn, Chelsea},
    booktitle = {Proceedings of the 38th International Conference on Machine Learning},
    year = {2021}
}

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

Installing the environment

Downloading the data

Running MACAW 🦜

Reach out!

Citation

Owner

Eric Mitchell

Event-forecasting - Event Forecasting Algorithms With Python

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)

A Japanese Medical Information Extraction Toolkit

JumpDiff: Non-parametric estimator for Jump-diffusion processes for Python

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

DockStream: A Docking Wrapper to Enhance De Novo Molecular Design

Simple tutorials using Google's TensorFlow Framework

La source de mon module 'pyfade' disponible sur Pypi.

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.

SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expertise

An implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods