Inferring Lexicographically-Ordered Rewards from Preferences

Code author: Alihan Hüyük ([email protected])

This repository contains the source code necessary to replicate the main experimental results in the AAAI 2022 paper "Inferring Lexicographically-Ordered Reward from Preferences." Our proposed method, LORI, is implemented in files src/main-lori.py and src/main-lori-liver.py for the problem settings considered in the paper: cancer treatment and organ transplantation respectively.

Usage

First, install the required python packages by running:

    python -m pip install -r requirements.txt

Then, the experiments in the paper can be replicated by running:

    ./src/run.sh        # generates the results in Tables 2 and 3
    ./src/run-liver.sh  # generates the reward functions in (10) and (11)

Note that, in order to run the experiments for the transplantation setting, you need to get access to the Organ Procurement and Transplantation Network (OPTN) dataset for liver transplantations as of December 4, 2020.

Citing

If you use this software please cite as follows:

@inproceedings{huyuk2022inferring,
  author={Alihan H\"uy\"uk and William R. Zame and Mihaela van der Schaar},
  title={Inferring lexicographically-ordered rewards from preferences},
  booktitle={Proceedings of the 36th AAAI Conference on Artificial Intelligence},
  year={2022}
}

Inferring Lexicographically-Ordered Rewards from Preferences

Related tags

Overview

Inferring Lexicographically-Ordered Rewards from Preferences

Usage

Citing

Owner

Alihan Hüyük

A Closer Look at Structured Pruning for Neural Network Compression

Neural network chess engine trained on Gary Kasparov's games.

deep learning model that learns to code with drawing in the Processing language

A toy compiler that can convert Python scripts to pickle bytecode 🥒

Extension to fastai for volumetric medical data

Adds timm pretrained backbone to pytorch's FasterRcnn model

Everything you need to know about NumPy( Creating Arrays, Indexing, Math,Statistics,Reshaping).

A GOOD REPRESENTATION DETECTS NOISY LABELS

🚗 INGI Dakar 2K21 - Be the first one on the finish line ! 🚗

S2s2net - Sentinel-2 Super-Resolution Segmentation Network

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation Official PyTorch Implementation

Code repository for the paper "Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation" with instructions to reproduce the results.

FlingBot: The Unreasonable Effectiveness of Dynamic Manipulations for Cloth Unfolding

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Hysterese plugin with two temperature offset areas

Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over Modules)

Uncertain natural language inference

PyTorch implementation of Deep HDR Imaging via A Non-Local Network (TIP 2020).

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

Explanatory Learning: Beyond Empiricism in Neural Networks