Accurate and Efficient Estimation of Small P-values with the Cross-Entropy Method: Applications in Genomic Data Analysis
Yang Shi, Mengqiao Wang, Weiping Shi, Ji-Hyun Lee, Huining Kang and, Hui Jiang

TL;DR
This paper introduces a novel method combining the cross-entropy approach and MCMC sampling to accurately estimate extremely small p-values in genomic data analysis, overcoming limitations of existing methods.
Contribution
It presents a general, efficient algorithm for estimating tiny p-values for complex test statistics where analytical solutions are unavailable.
Findings
Accurately estimates p-values as small as 10^{-100}
Demonstrates effectiveness through simulations and real genomic data
Improves existing statistical testing procedures in genomics
Abstract
Small -values are often required to be accurately estimated in large-scale genomic studies for the adjustment of multiple hypothesis tests and the ranking of genomic features based on their statistical significance. For those complicated test statistics whose cumulative distribution functions are analytically intractable, existing methods usually do not work well with small -values due to lack of accuracy or computational restrictions. We propose a general approach for accurately and efficiently estimating small -values for a broad range of complicated test statistics based on the principle of the cross-entropy method and Markov chain Monte Carlo sampling techniques. We evaluate the performance of the proposed algorithm through simulations and demonstrate its application to three real-world examples in genomic studies. The results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Gene expression and cancer classification · Statistical Methods and Inference
