Multi-Objective Reinforced Active Learning

Dependencies

wandb
tqdm
pytorch >= 1.7.0
numpy >= 1.20.0
scipy >= 1.1.0
pycolab == 1.2

Weights and Biases

Our code depends on for visualizing and logging results during training. As a result, we call wandb.init(), which will prompt to add an API key for linking the training runs with your personal wandb account. This can be done by pasting the WANDB_API_KEY into the respective box when running the code for the first time.

Environments

Our gridworlds (Emergency: randomized_v2.py, Delivery: randomized_v3.py) build on the game engine with a custom wrapper to provide similar functionality as the gym . This engine comes with a user interface and any environment can be played in the console using python environment.py with arrow keys and w, a, s, d as controls.

Training

There are four training scripts for

manually training a PPO agent on custom rewards (ppo_train.py),
training AIRL on a single expert dataset (airl_train.py),
active MORL with custom/automatic preferences (moral_train.py) and
training DRLHP with custom/automatic preferences (drlhp_train.py).

When using automatic preferences, a desired ratio can be passed as an argument. For example,

python moral_train.py --ratio a b c

will run MORAL using a (real-valued) ratio of a:b:c among the three explicit objectives in Delivery.

Hyperparameters

Hyperparameters are passed as arguments to wandb.init() and can be changed by modifying the respective training files.

Multi-Objective Reinforced Active Learning

Related tags

Overview

Multi-Objective Reinforced Active Learning

Dependencies

Weights and Biases

Environments

Training

Hyperparameters

Owner

Markus Peschl

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Unsupervised captioning - Code for Unsupervised Image Captioning

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Implementation of SiameseXML (ICML 2021)

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

Privacy-Preserving Portrait Matting [ACM MM-21]

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes".

Fast Neural Representations for Direct Volume Rendering

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

How to Learn a Domain Adaptive Event Simulator? ACM MM, 2021

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

Genetic Programming in Python, with a scikit-learn inspired API

Deeprl - Standard DQN and dueling network for simple games

darija <-> english dictionary

LERP : Label-dependent and event-guided interpretable disease risk prediction using EHRs

A PyTorch Implementation of FaceBoxes