Straightforward Bayesian A/B testing with Dirichlet posteriors

Dustin Hayden; Thomas Armitage

arXiv:2508.08077·stat.ME·August 12, 2025

Straightforward Bayesian A/B testing with Dirichlet posteriors

Dustin Hayden, Thomas Armitage

PDF

Open Access

TL;DR

This paper proposes a simplified Bayesian A/B testing method using Dirichlet posteriors, enabling scalable and robust analysis of multiple experiments without extensive domain-specific tuning.

Contribution

It introduces a Dirichlet-Categorical approximation for joint posteriors, reducing complexity and manual effort in Bayesian A/B testing workflows.

Findings

01

Performs well in simulations and real-world scenarios

02

Reduces need for expert-tuned models

03

Facilitates scalable analysis of multiple experiments

Abstract

Bayesian A/B testing investigates metric changes using the joint posterior distribution of two (or more) experimentally-derived datasets. The construction of said joint posterior is often a time-consuming process requiring specialized knowledge and domain expertise. In businesses that perform tens to hundreds of A/B tests per month it is important to have a robust analysis pipeline that can handle the variety of experiments performed on a modern web platform; requiring a domain expert to select appropriate prior and likelihood distributions for each experiment simply does not scale. In this work, we highlight a solution to this problem using a generalized approximation of the true joint posterior using a Dirichlet-Categorical model. While a manually-constructed, expert-tuned model for every dataset is preferable, the Dirichlet-Categorical approximation performs sufficiently well in both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods in Clinical Trials · Gene expression and cancer classification