Straightforward Bayesian A/B testing with Dirichlet posteriors
Dustin Hayden, Thomas Armitage

TL;DR
This paper proposes a simplified Bayesian A/B testing method using Dirichlet posteriors, enabling scalable and robust analysis of multiple experiments without extensive domain-specific tuning.
Contribution
It introduces a Dirichlet-Categorical approximation for joint posteriors, reducing complexity and manual effort in Bayesian A/B testing workflows.
Findings
Performs well in simulations and real-world scenarios
Reduces need for expert-tuned models
Facilitates scalable analysis of multiple experiments
Abstract
Bayesian A/B testing investigates metric changes using the joint posterior distribution of two (or more) experimentally-derived datasets. The construction of said joint posterior is often a time-consuming process requiring specialized knowledge and domain expertise. In businesses that perform tens to hundreds of A/B tests per month it is important to have a robust analysis pipeline that can handle the variety of experiments performed on a modern web platform; requiring a domain expert to select appropriate prior and likelihood distributions for each experiment simply does not scale. In this work, we highlight a solution to this problem using a generalized approximation of the true joint posterior using a Dirichlet-Categorical model. While a manually-constructed, expert-tuned model for every dataset is preferable, the Dirichlet-Categorical approximation performs sufficiently well in both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods in Clinical Trials · Gene expression and cancer classification
