Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

Text-to-Image generation

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

TorchGeo is a PyTorch domain library, similar to torchvision, that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data.

code for "Feature Importance-aware Transferable Adversarial Attacks"

A torch implementation of "Pixel-Level Domain Transfer"

GANimation: Anatomically-aware Facial Animation from a Single Image (ECCV'18 Oral) [PyTorch]

Kroomsa: A search engine for the curious

PyTorch code for training MM-DistillNet for multimodal knowledge distillation

Parametric Contrastive Learning (ICCV2021)

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

AI-generated-characters for Learning and Wellbeing

This code is for our paper "VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers"

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

An implementation of a discriminant function over a normal distribution to help classify datasets.

Dataset Condensation with Contrastive Signals

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

CARL provides highly configurable contextual extensions to several well-known RL environments.

Neural network for digit classification powered by cuda

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)