Are Multi-Armed Bandits Susceptible to Peeking?

Skip to the main content

Zagreb International Review of Economics & Business, Vol. 21 No. 1, 2018.

Conference paper

https://doi.org/10.2478/zireb-2018-0004

Are Multi-Armed Bandits Susceptible to Peeking?

Markus Loecher ; Berlin School of Economics and Law, Berlin, Germany

Full text: english pdf 650 Kb

page 95-104

downloads: 705

APA 6th Edition

Loecher, M. (2018). Are Multi-Armed Bandits Susceptible to Peeking?. Zagreb International Review of Economics & Business, 21 (1), 95-104. https://doi.org/10.2478/zireb-2018-0004

MLA 8th Edition

Loecher, Markus. "Are Multi-Armed Bandits Susceptible to Peeking?." Zagreb International Review of Economics & Business, vol. 21, no. 1, 2018, pp. 95-104. https://doi.org/10.2478/zireb-2018-0004. Accessed 22 Dec. 2024.

Chicago 17th Edition

Loecher, Markus. "Are Multi-Armed Bandits Susceptible to Peeking?." Zagreb International Review of Economics & Business 21, no. 1 (2018): 95-104. https://doi.org/10.2478/zireb-2018-0004

Harvard

Loecher, M. (2018). 'Are Multi-Armed Bandits Susceptible to Peeking?', Zagreb International Review of Economics & Business, 21(1), pp. 95-104. https://doi.org/10.2478/zireb-2018-0004

Vancouver

Loecher M. Are Multi-Armed Bandits Susceptible to Peeking?. Zagreb International Review of Economics & Business [Internet]. 2018 [cited 2024 December 22];21(1):95-104. https://doi.org/10.2478/zireb-2018-0004

IEEE

M. Loecher, "Are Multi-Armed Bandits Susceptible to Peeking?", Zagreb International Review of Economics & Business, vol.21, no. 1, pp. 95-104, 2018. [Online]. https://doi.org/10.2478/zireb-2018-0004

Abstract

A standard method to evaluate new features and changes to e.g. Web sites is A/B testing. A common pitfall in performing A/B testing is the habit of looking at a test while it's running, then stopping early. Due to the implicit multiple testing, the p-value is no longer trustworthy and usually too small. We investigate the claim that Bayesian methods, unlike frequentist tests, are immune to this "peeking" problem. We demonstrate that two regularly used measures, namely posterior probability and value remaining are severely affected by repeated testing. We further show a strong dependence on the prior probability of the parameters of interest.

Keywords

multiple comparisons; A/B testing; Bayesian decision theory

Hrčak ID:

200885

URI

https://hrcak.srce.hr/200885

Publication date:

31.5.2018.

Visits: 1.424 *