Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

This is the code for the paper "Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?" (link).

Code for Simulation

To replicate the simulation results reported in the paper, run the command:

python simulation.py

The function plot_regret_vs_horizon runs the simulation for running two algorithms jointly (greedy paired with either epsilon-greedy or UCB) in Figure 3 of the paper.
The function plot_2d runs the simulation for running a pair of algorithms jointly, and plot the expected bias and the probability of correct comparison, for epsilon-greedy, UCB, and Thompson sampling in Figure 4, Figure 5, and Figure 6 of the paper.

Citation

@article{li2025sharing,
	author    = {Shuangning Li and Chonghuan Wang and Jingyan Wang},
	title     = {Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?},
	journal   = {arXiv preprint arXiv:2507.11891},
	year      = {2025},
}

Contact

If you have any questions or feedback about the code or the paper, please contact Jingyan Wang (jingyanw@ttic.edu).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
simulation.py		simulation.py
utils_bandit.py		utils_bandit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

Code for Simulation

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

Code for Simulation

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages