How to reproduce experimental results of Stochastic bandits with groups of similar arms submitted paper ?
To reproduce all of the empirical results (and more) of the section 5 (Experiments) of the submitted paper, it is enough to run the regret_experiment.ipynb
notebook.
To reproduce all of the empirical results (and more) of the appendix D (Experiments) of the submitted paper, you may run the regret_experiment.ipynb
notebook as well as the dispatching_experiment.ipynb
notebook.
To run the jupyter notebooks, you may use the jupyter software (https://jupyter.org/) or use an online plateform such as google Colab (https://colab.research.google.com/)
The code has run on google Colab on Friday 4th, June without problem. The stability of the package that were used (numpy, matplotlib, itertools (built-in), datetime (built-in)) makes it very likely to run correctly before the next major updates of one of those packages.
Otherwise, the code has been tested locally on a machine with the following versions : python 3.9.5 numpy 1.20.3 matplotlib 3.4.1
The algorithms.py
file contains all the algorithms that were used in the experiments of this paper. All the sequential algorithms depends on SequentiAlg
, a class that is defined in the Forban
module.
In the Forban
module, a class for Bandit
configurations is also defined.