Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Last update: Dec 23, 2021

Related tags

Overview

ALPHAMEPOL

This repository contains the implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Installation

In order to use this codebase you need to work with a Python version >= 3.6. Moreover, you need to have a working setup of Mujoco with a valid Mujco license. To setup Mujoco, have a look here. To avoid any conflict with your existing Python setup, and to keep this project self-contained, it is suggested to work in a virtual environment with virtualenv. To install virtualenv:

pip install --upgrade virtualenv

Create a virtual environment, activate it and install the requirements:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Usage

Unsupervised Pre-Training

To reproduce the Unsupervised Pre-Training experiments in the paper, run:

./scripts/exploration/[gridworld_with_slope.sh | multigrid.sh | ant.sh | minigrid.sh]

Supervised Fine-Tuning

To reproduce the Supervised Fine-Tuning experiments, run:

./scripts/goal_rl/[gridworld_with_slope.sh | multigrid.sh | ant.sh | minigrid.sh]

By default, this will launch TRPO with ALPHAMEPOL initialization. To launch TRPO with a random initialization, simply omit the policy_init argument in the scripts.

Moreover, note that the scripts for the GridWorld with Slope and MultiGrid experiments have the argument num_goals = 50, meaning that the training will be performed with one goal at a time. If you want to speed up the process, you can use several processes (ideally one for each goal), by passing as argument num_goals = 1 and changing incrementally the seed. As regards the Ant and MiniGrid experiments, since the goals are predefined, you can also set the goal_index argument to specify a goal (from 0 to 7 and from 0 to 12 respectively).

Results Visualization

Once launched, each experiment will log statistics in the results folder. You can visualize everything by launching tensorboard targeting that directory:

python -m tensorboard.main --logdir=./results --port 8080

and visiting the board at http://localhost:8080.

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Related tags

Overview

ALPHAMEPOL

Installation

Usage

Unsupervised Pre-Training

Supervised Fine-Tuning

Results Visualization

Owner

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

Copy Paste positive polyp using poisson image blending for medical image segmentation

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Graph Convolutional Networks in PyTorch

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

Official code for the ICLR 2021 paper Neural ODE Processes

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

TransCD: Scene Change Detection via Transformer-based Architecture

DLL: Direct Lidar Localization

A 35mm camera, based on the Canonet G-III QL17 rangefinder, simulated in Python.

Keeper for Ricochet Protocol, implemented with Apache Airflow

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Neural style transfer as a class in PyTorch

An abstraction layer for mathematical optimization solvers.

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

Disentangled Lifespan Face Synthesis

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Related tags

Overview

ALPHAMEPOL

Installation

Usage

Unsupervised Pre-Training

Supervised Fine-Tuning

Results Visualization

Owner

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

Copy Paste positive polyp using poisson image blending for medical image segmentation

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Graph Convolutional Networks in PyTorch

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

Official code for the ICLR 2021 paper Neural ODE Processes

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

TransCD: Scene Change Detection via Transformer-based Architecture

DLL: Direct Lidar Localization

A 35mm camera, based on the Canonet G-III QL17 rangefinder, simulated in Python.

Keeper for Ricochet Protocol, implemented with Apache Airflow

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Neural style transfer as a class in PyTorch

An abstraction layer for mathematical optimization solvers.

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

Disentangled Lifespan Face Synthesis

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.