Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

DeLag: Detecting Latency Degradation Patterns in Service-based Systems

Code for paper: "Spinning Language Models for Propaganda-As-A-Service"

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

Code for ICCV2021 paper SPEC: Seeing People in the Wild with an Estimated Camera

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Representation

[CVPR 2022] Back To Reality: Weak-supervised 3D Object Detection with Shape-guided Label Enhancement

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

Adaptive Graph Convolution for Point Cloud Analysis

Winning solution of the Indoor Location & Navigation Kaggle competition

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Feature board for ERPNext

The BCNet related data and inference model.

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.