Power-Enhanced Two-Sample Mean Tests for High-Dimensional Compositional Data with Application to Microbiome Data Analysis
Danning Li, Lingzhou Xue, Haoyi Yang, Xiufan Yu

TL;DR
This paper introduces a new power-enhanced two-sample mean testing method for high-dimensional compositional data, effectively combining p-value based tests to improve detection power in microbiome studies.
Contribution
It develops a novel combination of maximum and quadratic p-value tests with theoretical guarantees, enhancing power and robustness in high-dimensional compositional data analysis.
Findings
Improved testing power over existing methods.
Accurate control of Type-I error rates.
Effective application to microbiome data.
Abstract
Testing differences in mean vectors is a fundamental task in the analysis of high-dimensional compositional data. Existing methods may suffer from low power if the underlying signal pattern is in a situation that does not favor the deployed test. In this work, we develop two-sample power-enhanced mean tests for high-dimensional compositional data based on the combination of -values, which integrates strengths from two popular types of tests: the maximum-type test and the quadratic-type test. We provide rigorous theoretical guarantees on the proposed tests, showing accurate Type-I error rate control and enhanced testing power. Our method boosts the testing power towards a broader alternative space, which yields robust performance across a wide range of signal pattern settings. Our theory also contributes to the literature on power enhancement and Gaussian approximation for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Mineral Processing and Grinding · Hydrocarbon exploration and reservoir analysis
