Analysis of distributional variation through multi-scale Beta-Binomial modeling
Li Ma, Jacopo Soriano

TL;DR
This paper introduces a multi-scale Bayesian hierarchical model called ANDOVA for comparing multiple data sets, effectively distinguishing true distributional differences from confounders by leveraging dependencies among tests.
Contribution
The paper presents a novel multi-scale Beta-Binomial based hierarchical model that accounts for dependencies among tests in multi-sample distribution comparison, improving accuracy.
Findings
Effective in identifying true distributional differences
Reduces false positives caused by confounders
Demonstrates strong performance in simulations and real data
Abstract
Many statistical analyses involve the comparison of multiple data sets collected under different conditions in order to identify the difference in the underlying distributions. A common challenge in multi-sample comparison is the presence of various confounders, or extraneous causes other than the conditions of interest that also contribute to the difference across the distributions. They result in false findings, i.e., identified differences that are not replicable in follow-up investigations. We consider an ANOVA approach to addressing this issue in multi-sample comparison---by collecting replicate data sets under each condition, thereby allowing the identification of the interesting distributional variation from the extraneous ones. We introduce a multi-scale Bayesian hierarchical model for the analysis of distributional variation (ANDOVA) under this design, based on a collection of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Algorithms and Data Compression · Gene expression and cancer classification
