Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Last update: Dec 13, 2022

Related tags

Deep Learning auto-drac

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

This is a PyTorch implementation of the methods proposed in

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning by

Roberta Raileanu, Max Goldstein, Denis Yarats, Ilya Kostrikov, and Rob Fergus.

Citation

If you use this code in your own work, please cite our paper:

@article{raileanu2020automatic,
  title={Automatic Data Augmentation for Generalization in Deep Reinforcement Learning},
  author={Raileanu, Roberta and Goldstein, Max and Yarats, Denis and Kostrikov, Ilya and Fergus, Rob},
  journal={arXiv preprint arXiv:2006.12862},
  year={2020}
}

Requirements

The code was run on a GPU with CUDA 10.2. To install all the required dependencies:

conda create -n auto-drac python=3.7
conda activate auto-drac

git clone [email protected]:rraileanu/auto-drac.git
cd auto-drac
pip install -r requirements.txt

git clone https://github.com/openai/baselines.git
cd baselines 
python setup.py install 

pip install procgen

Instructions

cd auto-drac

Train DrAC with crop augmentation on BigFish

python train.py --env_name bigfish --aug_type crop

Train UCB-DrAC on BigFish

python train.py --env_name bigfish --use_ucb

Train RL2-DrAC on BigFish

python train.py --env_name bigfish --use_rl2

Train Meta-DrAC on BigFish

python train.py --env_name bigfish --use_meta

Procgen Results

UCB-DrAC achieves state-of-the-art performance on the Procgen benchmark (easy mode), significantly improving the agent's generalization ability over standard RL methods such as PPO.

Test Results on Procgen

Train Results on Procgen

Agent Videos

You can find some videos of the agent's behavior while training on our website.

Acknowledgements

This code was based on an open sourced PyTorch implementation of PPO.

We also used kornia for some of the augmentations.

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Related tags

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

Citation

Requirements

Instructions

Train DrAC with crop augmentation on BigFish

Train UCB-DrAC on BigFish

Train RL2-DrAC on BigFish

Train Meta-DrAC on BigFish

Procgen Results

Agent Videos

Acknowledgements

Owner

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

Official Pytorch implementation for video neural representation (NeRV)

abess: Fast Best-Subset Selection in Python and R

Nsdf: A mesh SDF with just some code we can directly paste into our raymarcher

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

DLWP: Deep Learning Weather Prediction

Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

Deep Ensemble Learning with Jet-Like architecture

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Self-attentive task GAN for space domain awareness data augmentation.

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

Simple image captioning model - CLIP prefix captioning.

Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming soon!

Get a Grip! - A robotic system for remote clinical environments.

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

METER: Multimodal End-to-end TransformER

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".