Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Last update: Dec 07, 2022

Overview

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

About

This repository contains the code to replicate the synthetic experiment conducted in the paper "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model" by Haruka Kiyohara, Yuta Saito, Tatsuya Matsuhiro, Yusuke Narita, Nobuyuki Shimizu, and Yasuo Yamamoto, which has been accepted to WSDM2022.

If you find this code useful in your research then please site:

@inproceedings{kiyohara2022doubly,
  author = {Kiyohara, Haruka and Saito, Yuta and Matsuhiro, Tatsuya and Narita, Yusuke and Shimizu, Nobuyuki and Yamamoto, Yasuo},
  title = {Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model},
  booktitle = {Proceedings of the 15th International Conference on Web Search and Data Mining},
  pages = {xxx--xxx},
  year = {2022},
}

Dependencies

This repository supports Python 3.7 or newer.

numpy==1.20.0
pandas==1.2.1
scikit-learn==0.24.1
matplotlib==3.4.3
obp==0.5.2
hydra-core==1.0.6

Note that the proposed Cascade-DR estimator is implemented in Open Bandit Pipeline (obp.ope.SlateCascadeDoublyRobust).

Running the code

To conduct the synthetic experiment, run the following commands.

(i) run OPE simulations with varying data size, with the fixed slate size.

python src/main.py setting=n_rounds

(ii), (iii) run OPE simulations with varying slate size and policy similarities, with the fixed data size.

python src/main.py

Once the code is finished executing, you can find the results (squared_error.csv, relative_ee.csv, configuration.csv) in the ./logs/ directory. Lower value is better for squared error and relative estimation error (relative-ee).

Visualize the results

To visualize the results, run the following commands. Make sure that you have executed the above two experiments (by running python src/main.py and python src/main.py setting=default) before visualizing the results.

python src/visualize.py

Then, you will find the following figures (slate size (standard/cascade/independent).png, evaluation policy similarity (standard/cascade/independent).png, data size (standard/cascade/independent).png) in the ./logs/ directory. Lower value is better for the relative-MSE (y-axis).

reward structure	Standard	Cascade	Independent
varying data size (n)
varying slate size (L)
varying evaluation policy similarity (λ)

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Related tags

Overview

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

About

Dependencies

Running the code

Visualize the results

Owner

Haruka Kiyohara

Emulation and Feedback Fuzzing of Firmware with Memory Sanitization

Official PaddlePaddle implementation of Paint Transformer

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

Learn other languages using artificial intelligence with python.

Code samples for my book "Neural Networks and Deep Learning"

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Code Repository for The Kaggle Book, Published by Packt Publishing

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Adabelief-Optimizer - Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Deeplearning project at The Technological University of Denmark (DTU) about Neural ODEs for finding dynamics in ordinary differential equations and real world time series data

Perception-aware multi-sensor fusion for 3D LiDAR semantic segmentation (ICCV 2021)

Tesla Light Show xLights Guide With python

Open source person re-identification library in python

YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset

Emotional conditioned music generation using transformer-based model.

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Related tags

Overview

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

About

Dependencies

Running the code

Visualize the results

Owner

Haruka Kiyohara

Emulation and Feedback Fuzzing of Firmware with Memory Sanitization

Official PaddlePaddle implementation of Paint Transformer

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

Learn other languages ​​using artificial intelligence with python.

Code samples for my book "Neural Networks and Deep Learning"

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Code Repository for The Kaggle Book, Published by Packt Publishing

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Adabelief-Optimizer - Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Deeplearning project at The Technological University of Denmark (DTU) about Neural ODEs for finding dynamics in ordinary differential equations and real world time series data

Perception-aware multi-sensor fusion for 3D LiDAR semantic segmentation (ICCV 2021)

Tesla Light Show xLights Guide With python

Open source person re-identification library in python

YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset

Emotional conditioned music generation using transformer-based model.

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Learn other languages using artificial intelligence with python.