Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Last update: Sep 09, 2022

Related tags

Overview

HierarchicyBandit

Introduction

This is the implementation of WSDM 2022 paper : Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations
The reference codes for HCB and pHCB, which are based on three different base bandit algorithms.

LinUCB from A contextual-bandit approach to personalized news article recommendation
epsilon-Greedy [This strategy, with random exploration on an epsilon fraction of the traffic and greedy exploitation on the rest]
Thompson Sampling from Thompson Sampling for Contextual Bandits with Linear Payoffs

Files in the folder

data/
- MIND/ and TaoBao/
  - item_info.pkl: processed item file, including item id, item feature and embeddings for simulator;
  - user_info.pkl: processed user file, including user id, and embeddings for simulator;
  - item_info_ts.pkl: processed item file for Thompson sampling;
algs/: implementations of PCB and pHCB based on LinUCB.
algsE/: implementations of PCB and pHCB based on epsilon-Greedy.
algsTS/: implementations of PCB and pHCB based on Thompson Sampling.

Note

Before testing the algorithms, you should modify the settings in config.py.
For thompson sampling, we provide another 16 dimensonal feature vectors to run the experiments, since it can be faster . The original feature vectors are also work with the algorithms.
the user_info.pkl and item_info.pkl is formated as dictionary type.
The implementation of ConUCB is released at ConUCB. HMAB and ICTRUCB are specical case of CB-Category and CB-Leaf.

Usage:

Download the HierarchicyBandit.zip and unzip. You will get five folders, they are algs/, algsE/, algsTS/, data/, and logger/.

Parameters:
The config.py file contains:

dataset: is the dataset used in the experiment, it can be 'MIND' or 'TaoBao';  
T: the number of rounds of each bandit algorithm;  
k: the number of items recommended to user at each round, default is 1;  
activate_num: the hyper-papamter p for pHCB;  
activate_prob: the hyper-papamter q for pHCB;  
epsilon: the epsilon value for greedy-based algorithms;  
new_tree_file: the tree file name;  
noise_scale: the standard deviation of environmental noise;  
keep_prob: sample ratio; default is 1.0, which means testing all users.
linucb_para: the hyper-parameters for linucb algorithm;
ts_para: the hyper-parameters for thompson sampling algorithm;
poolsize: the size of candidate pool;
random_choice: whether random choice an item to user;

Environment: python 3.6 with Anaconda To run the bandit codes based on LinUCB:

$ cd algs
$ python simulator_multi_process.py

To run the bandit codes based on epsilon-Greedy:

$ cd algsE
$ python simulator_multi_process.py

To run the bandit codes based on Thompson sampling:

$ cd algsTS
$ python simulator_multi_process.py

Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Related tags

Overview

HierarchicyBandit

Introduction

Files in the folder

Usage:

Owner

yu song

Neural Point-Based Graphics

a simple, efficient, and intuitive text editor

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Cave Generation using metaballs in Blender. Originally created by sdfgeoff, Edited by Myself (Archie Jaskowicz).

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Image Captioning using CNN ,LSTM and Attention

Automatic self-diagnosis program (python required)Automatic self-diagnosis program (python required)

Python based Advanced AI Assistant

Multi-Scale Progressive Fusion Network for Single Image Deraining

Learning to Segment Instances in Videos with Spatial Propagation Network

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Jax/Flax implementation of Variational-DiffWave.

The source code and dataset for the RecGURU paper (WSDM 2022)

Offline Reinforcement Learning with Implicit Q-Learning

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Pytorch Implementation of "Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation"

Parsing, analyzing, and comparing source code across many languages

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers