Integrative decomposition of multi-source data by identifying partially-joint score subspaces
SeoWon Gabriel Choi, Sungkyu Jung

TL;DR
This paper introduces a novel, fast algorithm for decomposing multi-source data to identify shared and source-specific factors, demonstrated on multi-omics biological data to uncover hidden structures.
Contribution
The paper proposes a new integrative decomposition framework that utilizes source-wise factor score subspaces to identify partially-joint associations, improving accuracy and speed over existing methods.
Findings
Successfully identified a partially-joint latent score in multi-omics data
Outperformed competing methods in structure recovery accuracy
Applied to blood cancer data to discover hidden clusters
Abstract
Analysis of multi-source dataset, where data on the same objects are collected from multiple sources, is of rising importance in many fields, most notably in multi-omics biology. A novel framework and algorithms for integrative decomposition of such multi-source data are proposed to identify and sort out common factor scores in terms of whether the scores are relevant to all data sources (fully joint), to some data sources (partially joint), or to a single data source. The key difference between the proposed method and existing approaches is that raw source-wise factor score subspaces are utilized in the identification of the partially-joint block-wise association structure. To identify common score subspaces, which may be partially joint to some of data sources, from noisy observations, the proposed algorithm sequentially computes one-dimensional flag means among source-wise score…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genetic Associations and Epidemiology
