Inferring Lexicographically-Ordered Rewards from Preferences

Code author: Alihan Hüyük ([email protected])

This repository contains the source code necessary to replicate the main experimental results in the AAAI 2022 paper "Inferring Lexicographically-Ordered Reward from Preferences." Our proposed method, LORI, is implemented in files src/main-lori.py and src/main-lori-liver.py for the problem settings considered in the paper: cancer treatment and organ transplantation respectively.

Usage

First, install the required python packages by running:

    python -m pip install -r requirements.txt

Then, the experiments in the paper can be replicated by running:

    ./src/run.sh        # generates the results in Tables 2 and 3
    ./src/run-liver.sh  # generates the reward functions in (10) and (11)

Note that, in order to run the experiments for the transplantation setting, you need to get access to the Organ Procurement and Transplantation Network (OPTN) dataset for liver transplantations as of December 4, 2020.

Citing

If you use this software please cite as follows:

@inproceedings{huyuk2022inferring,
  author={Alihan H\"uy\"uk and William R. Zame and Mihaela van der Schaar},
  title={Inferring lexicographically-ordered rewards from preferences},
  booktitle={Proceedings of the 36th AAAI Conference on Artificial Intelligence},
  year={2022}
}

Inferring Lexicographically-Ordered Rewards from Preferences

Related tags

Overview

Inferring Lexicographically-Ordered Rewards from Preferences

Usage

Citing

Owner

Alihan Hüyük

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

vit for few-shot classification

CAUSE: Causality from AttribUtions on Sequence of Events

Hand gesture recognition model that can be used as a remote control for a smart tv.

This repository contains part of the code used to make the images visible in the article "How does an AI Imagine the Universe?" published on Towards Data Science.

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

A simple python program that can be used to implement user authentication tokens into your program...

The original implementation of TNDM used in the NeurIPS 2021 paper (no longer being updated)

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

A clean and extensible PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

Extreme Lightwegith Portrait Segmentation

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

This library contains a Tensorflow implementation of the paper Stability Analysis of Unfolded WMMSE for Power Allocation

Modified prey-predator system - Modified prey–predator model describes the rate of change for each species by adding coupling terms.

An onlinel learning to rank python codebase.

Individual Treatment Effect Estimation

Replication attempt for the Protein Folding Model

An implementation of the efficient attention module.

face_recognization (FaceNet) + TFHE (HNP) + hand_face_detection (Mediapipe)