Incorporating Covariates into Integrated Factor Analysis of Multi-View Data
Gen Li, Sungkyu Jung

TL;DR
This paper introduces a supervised integrated factor analysis model that incorporates covariates for multi-view data, enabling better dimension reduction and interpretation in biomedical research.
Contribution
The paper develops a novel statistical model, SIFA, that decomposes multi-view data into joint and individual factors while integrating auxiliary covariates, with an efficient EM algorithm for fitting.
Findings
Applied to GTEx data, revealing new insights into gene expression variation.
Simulation studies show SIFA outperforms existing methods.
Demonstrated effectiveness in a pediatric growth study.
Abstract
In modern biomedical research, it is ubiquitous to have multiple data sets measured on the same set of samples from different views (i.e., multi-view data). For example, in genetic studies, multiple genomic data sets at different molecular levels or from different cell types are measured for a common set of individuals to investigate genetic regulation. Integration and reduction of multi-view data have the potential to leverage information in different data sets, and to reduce the magnitude and complexity of data for further statistical analysis and interpretation. In this paper, we develop a novel statistical model, called supervised integrated factor analysis (SIFA), for integrative dimension reduction of multi-view data while incorporating auxiliary covariates. The model decomposes data into joint and individual factors, capturing the joint variation across multiple data sets and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genetic and phenotypic traits in livestock · Statistical Methods and Inference
