TL;DR
This study introduces higher order methylation features derived from probabilistic machine learning to improve gene expression prediction and reveal methylation patterns at promoters, advancing understanding of methylation's regulatory role.
Contribution
The paper presents a novel method to extract and utilize higher order methylation features, enhancing prediction accuracy and uncovering methylation patterns beyond CpG islands.
Findings
Higher order features significantly improve gene expression prediction.
Five major methylation patterns identified at promoters.
Spatial methylation correlations relate to gene regulation.
Abstract
Motivation: DNA methylation is an intensely studied epigenetic mark, yet its functional role is incompletely understood. Attempts to quantitatively associate average DNA methylation to gene expression yield poor correlations outside of the well-understood methylation-switch at CpG islands. Results: Here we use probabilistic machine learning to extract higher order features associated with the methylation profile across a defined region. These features quantitate precisely notions of shape of a methylation profile, capturing spatial correlations in DNA methylation across genomic regions. Using these higher order features across promoter-proximal regions, we are able to construct a powerful machine learning predictor of gene expression, significantly improving upon the predictive power of average DNA methylation levels. Furthermore, we can use higher order features to cluster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
