A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Last update: Nov 23, 2022

Overview

SOFA

This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.

The framework shows how RL4Rec typically interacts with a simulation-based environment. A state is user historical interactions, an action is an item being recommended bytheRS, and a reward is related to user feedback.

As a solution to the effect of bias present in logged data, we introduce a debiasing step in the simulation pipeline, which corrects for the biases present in the logged data before it is used to simulate user behavior.

Running the code

$ cd examples
$ python run_dqn.py

More details

We provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix). And we also provide the slide used for presentation in recsys 2020.

Cite

If you use our code, please cite our paper:

@inproceedings{huang2020keeping,
  title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
  author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
  booktitle={Fourteenth ACM Conference on Recommender Systems},
  pages={190--199},
  year={2020}
}

A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Related tags

Overview

SOFA

Running the code

More details

Cite

Owner

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

Tooling for GANs in TensorFlow

An MQA (Studio, originalSampleRate) identifier for lossless flac files written in Python.

A simple baseline for 3d human pose estimation in PyTorch.

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

DIT is a DTLS MitM proxy implemented in Python 3. It can intercept, manipulate and suppress datagrams between two DTLS endpoints and supports psk-based and certificate-based authentication schemes (RSA + ECC).

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).

Competitive Programming Club, Clinify's Official repository for CP problems hosting by club members.

AirCode: A Robust Object Encoding Method

OpenMMLab Text Detection, Recognition and Understanding Toolbox

A PyTorch implementation of "Pathfinder Discovery Networks for Neural Message Passing"

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

A pyparsing-based library for parsing SOQL statements

Liver segmentation using MONAI and pytorch

[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Database Reasoning Over Text project for ACL paper