PIntMF: Penalized Integrative Matrix Factorization Method for Multi-Omics Data
Morgane Pierre-Jean (CNRGH), Florence Mauger (CNRGH),, Jean-Fran\c{c}ois Deleuze (CNRGH), Edith Le Floch

TL;DR
PIntMF is a novel penalized matrix factorization method that effectively clusters and identifies relevant variables in multi-omics data, improving interpretability and outperforming existing methods in both synthetic and real datasets.
Contribution
Introduces PIntMF, a sparse, constrained matrix factorization model with automatic tuning, enhancing multi-omics data integration and interpretation.
Findings
Successfully identifies relevant clusters and variables in synthetic data.
Reveals interpretable clusters linked to clinical data in real datasets.
Outperforms existing methods in clustering and variable selection.
Abstract
It is more and more common to explore the genome at diverse levels and not only at a single omic level. Through integrative statistical methods, omics data have the power to reveal new biological processes, potential biomarkers, and subgroups of a cohort. The matrix factorization (MF) is a unsupervised statistical method that allows giving a clustering of individuals, but also revealing relevant omic variables from the various blocks. Here, we present PIntMF (Penalized Integrative Matrix Factorization), a model of MF with sparsity, positivity and equality constraints.To induce sparsity in the model, we use a classical Lasso penalization on variable and individual matrices. For the matrix of samples, sparsity helps for the clustering, and normalization (matching an equality constraint) of inferred coefficients is added for a better interpretation. Besides, we add an automatic tuning of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
