Bayesian Probit Multi-Study Non-negative Matrix Factorization for Mutational Signatures
Blake Hansen, Isabella N. Grabski, Giovanni Parmigiani, Roberta De, Vito

TL;DR
This paper introduces a Bayesian multi-study NMF method for analyzing mutational signatures across different cancer types, enabling more precise, interpretable, and tissue-specific insights into tumor mutational processes.
Contribution
The paper presents a novel Bayesian multi-study NMF model that incorporates sparsity and covariate dependence to improve mutational signature estimation across multiple cancer types.
Findings
Accurately estimates mutational signatures in diverse cancer datasets.
Identifies tissue-specific and individual mutational signature patterns.
Provides an interpretable framework for patient subtyping based on mutational profiles.
Abstract
Mutational signatures are patterns of somatic mutations in tumor genomes that provide insights into underlying mutagenic processes and cancer origin. Developing reliable methods for their estimation is of growing importance in cancer biology. Somatic mutation data are often collected for different cancer types, highlighting the need for multi-study approaches that enable joint analysis in a principled and integrative manner. Despite significant advancements, statistical models tailored for analyzing the genomes of multiple cancer types remain underexplored. In this work, we introduce a Bayesian Multi-Study Non-negative Matrix Factorization (NMF) approach that uses mixture modeling to incorporate sparsity in the exposure weights of each subject to mutational signatures, allowing for individual tumor profiles to be represented by a subset rather than all signatures, and making this subset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Chromatin Dynamics · Machine Learning in Bioinformatics
