A novel family of beta mixture models for the differential analysis of DNA methylation data: an application to prostate cancer
Koyel Majumdar, Romina Silva, Antoinette Sabrina Perry, Ronald William, Watson, Andrea Rau, Florence Jaffrezic, Thomas Brendan Murphy, Isobel, Claire Gormley

TL;DR
This paper introduces a new family of beta mixture models that analyze DNA methylation data directly without transformation, enabling biologically interpretable detection of differential methylation in prostate cancer studies.
Contribution
The paper presents a novel beta mixture modeling approach that infers methylation thresholds and identifies differentially methylated sites directly from untransformed data.
Findings
Successfully identified biologically relevant DMCs in prostate cancer data.
Demonstrated the model's effectiveness through simulation studies.
Provided an R package for easy application of the method.
Abstract
Identifying differentially methylated cytosine-guanine dinucleotide (CpG) sites between benign and tumour samples can assist in understanding disease. However, differential analysis of bounded DNA methylation data often requires data transformation, reducing biological interpretability. To address this, a family of beta mixture models (BMMs) is proposed that (i) objectively infers methylation state thresholds and (ii) identifies differentially methylated CpG sites (DMCs) given untransformed, beta-valued methylation data. The BMMs achieve this through model-based clustering of CpG sites and by employing parameter constraints, facilitating application to different study settings. Inference proceeds via an expectation-maximisation algorithm, with an approximate maximization step providing tractability and computational feasibility. Performance of the BMMs is assessed through thorough…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Algorithms and Data Compression
