Multi-Objective Reinforced Active Learning

Last update: Nov 19, 2022

Related tags

Deep Learning moral_rl

Overview

Multi-Objective Reinforced Active Learning

Dependencies

wandb
tqdm
pytorch >= 1.7.0
numpy >= 1.20.0
scipy >= 1.1.0
pycolab == 1.2

Weights and Biases

Our code depends on for visualizing and logging results during training. As a result, we call wandb.init(), which will prompt to add an API key for linking the training runs with your personal wandb account. This can be done by pasting the WANDB_API_KEY into the respective box when running the code for the first time.

Environments

Our gridworlds (Emergency: randomized_v2.py, Delivery: randomized_v3.py) build on the game engine with a custom wrapper to provide similar functionality as the gym . This engine comes with a user interface and any environment can be played in the console using python environment.py with arrow keys and w, a, s, d as controls.

Training

There are four training scripts for

manually training a PPO agent on custom rewards (ppo_train.py),
training AIRL on a single expert dataset (airl_train.py),
active MORL with custom/automatic preferences (moral_train.py) and
training DRLHP with custom/automatic preferences (drlhp_train.py).

When using automatic preferences, a desired ratio can be passed as an argument. For example,

python moral_train.py --ratio a b c

will run MORAL using a (real-valued) ratio of a:b:c among the three explicit objectives in Delivery.

Hyperparameters

Hyperparameters are passed as arguments to wandb.init() and can be changed by modifying the respective training files.

Multi-Objective Reinforced Active Learning

Related tags

Overview

Multi-Objective Reinforced Active Learning

Dependencies

Weights and Biases

Environments

Training

Hyperparameters

Owner

Markus Peschl

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Towards Multi-Camera 3D Human Pose Estimation in Wild Environment

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Wide Residual Networks (WideResNets) in PyTorch

"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021

TUPÃ was developed to analyze electric field properties in molecular simulations

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

Dirty Pixels: Towards End-to-End Image Processing and Perception

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

PyTorch code of "SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks"

In generative deep geometry learning, we often get many obj files remain to be rendered

Python utility to generate filesystem content for Obsidian.

Low Complexity Channel estimation with Neural Network Solutions

Deep Inside Convolutional Networks - This is a caffe implementation to visualize the learnt model

Bot developed in Python that automates races in pegaxy.

ICON: Implicit Clothed humans Obtained from Normals

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

An executor that performs image segmentation on fashion items