Multiple Testing with Heterogeneous Multinomial Distributions
Joshua Habiger, David Watts, Michael Anderson

TL;DR
This paper introduces a new false discovery rate (FDR) method tailored for heterogeneous multinomial data, improving the identification of truly associated species in microbiome studies.
Contribution
The paper proposes a mixture multinomial-based FDR method that better detects significant effects in heterogeneous data compared to existing procedures.
Findings
More non-negligible effects are discovered with the new method.
Fewer negligible effects are identified, reducing false positives.
The method outperforms standard procedures on real microbiome data.
Abstract
False discovery rate (FDR) procedures provide misleading inference when testing multiple null hypotheses with heterogeneous multinomial data. For example, in the motivating study the goal is to identify species of bacteria near the roots of wheat plants (rhizobacteria) that are associated with productivity, but standard procedures discover the most abundant species even when the association is weak or negligible, and fail to discover strong associations when species are not abundant. Consequently, a list of abundant species is produced by the multiple testing procedure even though the goal was to provide a list of producitivity-associated species. This paper provides an FDR method based on a mixture of multinomial distributions and shows that it tends to discover more non-negligible effects and fewer negligible effects when the data are heterogeneous across tests. The proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Pesticide Residue Analysis and Safety · Meta-analysis and systematic reviews
