Continual Learning in Environments with Polynomial Mixing Times

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
algos		algos
envs		envs
tests		tests
.gitignore		.gitignore
README.md		README.md
results.ipynb		results.ipynb
run_bottleneck.sh		run_bottleneck.sh
setup.py		setup.py
tabular_expts.py		tabular_expts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algos

algos

envs

envs

tests

tests

.gitignore

.gitignore

README.md

README.md

results.ipynb

results.ipynb

run_bottleneck.sh

run_bottleneck.sh

setup.py

setup.py

tabular_expts.py

tabular_expts.py

Repository files navigation

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

About

Releases

Packages

Languages

SharathRaparthy/polynomial-mixing-times

Folders and files

Latest commit

History

Repository files navigation

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

About

Resources

Stars

Watchers

Forks

Languages