Where to find needles in a haystack?
Zhigen Zhao

TL;DR
This paper introduces CLAT, a new multiple testing method that uses cumulative distribution functions to improve signal detection, especially when p-value based methods are inadequate, demonstrating superior performance and scalability.
Contribution
The paper proposes CLAT, a novel method combining CDF and local fdr for multiple testing, with fast convergence and scalable computation, outperforming existing approaches.
Findings
CLAT outperforms existing methods in simulations and real data.
The data-driven CLAT converges quickly to the oracle version.
The algorithm is computationally efficient and scalable.
Abstract
In many existing methods in multiple comparison, one starts with either Fisher's p-values or the local fdr scores. The former one, with a usual definition as the tail probability exceeding the observed test statistic under the null distribution, fails to use the information from the alternative hypothesis and the targeted region of signals could be completely wrong especially when the likelihood ratio function is not monotone. The local fdr based approaches, usually relying on the density functions, are optimal oracally. However, the targeted region of the signals of the data-driven version is problematic because of the slow convergence of the non-parametric density estimation especially on the boundaries. In this paper, we propose a new method: Cdf and Local fdr Assisted multiple Testing method (CLAT), which is optimal for cases when the p-values based method are not. Additionally, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Optimal Experimental Design Methods · Statistical Methods and Inference
