Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Last update: Dec 03, 2022

Related tags

Overview

Hierarchical reinforcement learning with Timed Subgoals (HiTS)

This repository contains code for reproducing experiments from our paper "Hierarchical reinforcement learning with Timed Subgoals". The implementation of the Hierarchical reinforcement learning with Timed Subgoals (HiTS) algorithm can be found in the Graph-RL repository.

HiTS enables sample-efficient learning in sparse-reward, long-horizong tasks. In particular, it extends subgoal-based hierarchical reinforcement learning to environments with dynamic elements which are, most of the time, beyond the control of the agent. Due to the use of timed subgoals and hindsight action relabeling the higher level sees transitions that are consistent with a stationary effective environment. As a result both levels in the hierarchy can learn concurrently and efficiently.

The three benchmark tasks in dynamic environments from the paper are contained in the dynamic-rl-benchmarks repository. If you are interested in applying HiTS to a different task, then this demo in the Graph-RL repository is the best place to start.

Installation

We recommend using a virtual environment with python3.7 or higher. Make sure pip is up to date. In the root directory of the repository execute:

pip install -r requirements.txt

Usage

To render episodes with one of the pretrained policies execute in the root directory:

python -m scripts.run.render --algo hits --env Platforms

Available algorithms:

hits
hac
sac

Available environments:

AntFourRooms
Drawbridge
Pendulum
Platforms
Tennis2D
UR5Reacher

A policy can be be trained from scratch by running:

python -m scripts.run.train --algo hits --env Platforms

To render episodes with a newly trained policy use:

python -m scripts.run.render --algo hits --env Platforms --newly_trained

To render an episode with the stochastic policy used during training:

python -m scripts.run.render --algo hits --env Platforms --newly_trained --stochastic

Hyperparameters and seeds can be found in the graph_params.json files in the data directory. The key level_params_list contains a list of the hyperparameters of all levels, starting with the lowest level.

How to cite

Please use the following BibTex entry.

@article{gurtler2021hierarchical,
  title={Hierarchical Reinforcement Learning with Timed Subgoals},
  author={G{\"u}rtler, Nico and B{\"u}chler, Dieter and Martius, Georg},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Related tags

Overview

Hierarchical reinforcement learning with Timed Subgoals (HiTS)

Installation

Usage

How to cite

Owner

Autonomous Learning Group

Image Deblurring using Generative Adversarial Networks

Optimize Trading Strategies Using Freqtrade

This is a deep learning-based method to segment deep brain structures and a brain mask from T1 weighted MRI.

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Official Implementation of "Transformers Can Do Bayesian Inference"

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation Official PyTorch Implementation

Used to record WKU's utility bills on a regular basis.

Laplace Redux -- Effortless Bayesian Deep Learning

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

Experiments with differentiable stacks and queues in PyTorch

This repository provides an efficient PyTorch-based library for training deep models.

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

The pure and clear PyTorch Distributed Training Framework.

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Prior-Guided Multi-View 3D Head Reconstruction

FlingBot: The Unreasonable Effectiveness of Dynamic Manipulations for Cloth Unfolding