TL;DR
This paper introduces a Bayesian hierarchical model for genome- and epigenome-wide association studies that accounts for gene-level dependence, improving the detection of associations in high-throughput genetic data.
Contribution
It proposes a nonparametric Bayesian approach that models gene-specific association probabilities, enabling better multiplicity adjustment and integration into existing screening methods.
Findings
Applied to DNA methylation data from glioma tumors.
Demonstrated improved detection of gene-level associations.
Software implementation available in R package BayesianScreening.
Abstract
High-throughput genetic and epigenetic data are often screened for associations with an observed phenotype. For example, one may wish to test hundreds of thousands of genetic variants, or DNA methylation sites, for an association with disease status. These genomic variables can naturally be grouped by the gene they encode, among other criteria. However, standard practice in such applications is independent screening with a universal correction for multiplicity. We propose a Bayesian approach in which the prior probability of an association for a given genomic variable depends on its gene, and the gene-specific probabilities are modeled nonparametrically. This hierarchical model allows for appropriate gene and genome-wide multiplicity adjustments, and can be incorporated into a variety of Bayesian association screening methodologies with negligible increase in computational complexity.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
