TL;DR
This paper introduces a multivariate sparse fusion method to analyze heterogeneity in genetic regulations across subgroups, guided by biomarkers, addressing high-dimensional challenges in complex disease studies.
Contribution
The paper proposes a novel penalized fusion approach for identifying subgroup structures and regulation relationships in high-dimensional genetic data, guided by known biomarkers.
Findings
Identified distinct genetic regulation patterns in melanoma and stomach cancer.
Demonstrated the effectiveness of the method through extensive simulations.
Applied the approach to TCGA data revealing biologically relevant heterogeneity.
Abstract
Heterogeneity is a hallmark of many complex diseases. There are multiple ways of defining heterogeneity, among which the heterogeneity in genetic regulations, for example GEs (gene expressions) by CNVs (copy number variations) and methylation, has been suggested but little investigated. Heterogeneity in genetic regulations can be linked with disease severity, progression, and other traits and is biologically important. However, the analysis can be very challenging with the high dimensionality of both sides of regulation as well as sparse and weak signals. In this article, we consider the scenario where subjects form unknown subgroups, and each subgroup has unique genetic regulation relationships. Further, such heterogeneity is "guided" by a known biomarker. We develop an MSF (Multivariate Sparse Fusion) approach, which innovatively applies the penalized fusion technique to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
