Multiple Testing in Genome-Wide Association Studies via Hierarchical Hidden Markov Models
Pengfei Wang, Zhaofeng Tian

TL;DR
This paper introduces a hierarchical hidden Markov model-based multiple testing procedure for genome-wide association studies that accounts for local correlations among tests, improving detection power and interpretability.
Contribution
It develops a novel hierarchical HMM approach for GWAS multiple testing that automatically segments chromosome regions and models local correlations, enhancing efficiency.
Findings
Outperforms existing methods in simulations
Valid and optimal theoretical properties established
Effective in real GWAS data analysis
Abstract
The problems of large-scale multiple testing are often encountered in modern scientific researches. Conventional multiple testing procedures usually suffer considerable loss of testing efficiency due to the lack of consideration of correlations among tests. In fact, the appropriate use of correlation information not only enhances the efficacy of multiple testing but also improves the interpretability of the results. Since the disease- or trait-related single nucleotide polymorphisms (SNPs) often tend to be clustered and exhibit serial correlations, the hidden Markov model (HMM) based multiple testing procedure has been successfully applied in genome-wide association studies (GWAS). It is important to note that modeling the entire chromosome using one HMM is somewhat rough. To overcome this issue, this paper employs the hierarchical hidden Markov model (HHMM) to describe local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genetic Associations and Epidemiology · Genomic variations and chromosomal abnormalities
