Doubly Non-Central Beta Matrix Factorization for DNA Methylation Data
Aaron Schein, Anjali Nagulpally, Hanna Wallach, Patrick Flaherty

TL;DR
This paper introduces a novel non-negative matrix factorization model based on the doubly non-central beta distribution, specifically designed for modeling complex DNA methylation data with improved predictive accuracy and biologically meaningful latent features.
Contribution
The paper develops a new DNCB-based matrix factorization model for bounded data, with an efficient inference algorithm, enhancing DNA methylation analysis and general applicability.
Findings
Outperforms existing methods in DNA methylation prediction
Produces biologically meaningful latent representations
Demonstrates versatility for other bounded data domains
Abstract
We present a new non-negative matrix factorization model for bounded-support data based on the doubly non-central beta (DNCB) distribution, a generalization of the beta distribution. The expressiveness of the DNCB distribution is particularly useful for modeling DNA methylation datasets, which are typically highly dispersed and multi-modal; however, the model structure is sufficiently general that it can be adapted to many other domains where latent representations of bounded-support data are of interest. Although the DNCB distribution lacks a closed-form conjugate prior, several augmentations let us derive an efficient posterior inference algorithm composed entirely of analytic updates. Our model improves out-of-sample predictive performance on both real and synthetic DNA methylation datasets over state-of-the-art methods in bioinformatics. In addition, our model yields…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Chromatin Dynamics · Epigenetics and DNA Methylation
