A Statistical Model for the Analysis of Beta Values in DNA Methylation Studies
Leonie Weinhold, Simone Wahl, Matthias Schmid

TL;DR
This paper introduces a new statistical model based on a bivariate gamma distribution to analyze DNA methylation beta values, addressing limitations of traditional methods and improving association detection with covariates.
Contribution
A novel bivariate gamma-based model for beta value analysis that accounts for correlation between signal intensities, enhancing epigenome-wide association studies.
Findings
Model explicitly accounts for correlation between M and U
Improves detection of associations with covariates
Comparable ease of use to existing methods
Abstract
Background: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M = (M + U) that are generated by Illumina's 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. Results: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpigenetics and DNA Methylation · Gene expression and cancer classification · Genomics and Chromatin Dynamics
