Generalized Integrative Principal Component Analysis for Multi-Type Data with Block-Wise Missing Structure
Huichen Zhu, Gen Li, Eric F. Lock

TL;DR
This paper introduces GIPCA, a low-rank method for dimension reduction and imputation of multi-source data with heterogeneous types and block-wise missingness, improving analysis accuracy in complex datasets.
Contribution
The paper proposes GIPCA, a novel low-rank approach that handles multiple data types and block-wise missing data for integrative analysis and imputation.
Findings
GIPCA accurately estimates ranks and recovers signals in simulations.
It effectively imputes block-wise missing data in real-world applications.
GIPCA reveals meaningful latent patterns in mortality data.
Abstract
High-dimensional multi-source data are encountered in many fields. Despite recent developments on the integrative dimension reduction of such data, most existing methods cannot easily accommodate data of multiple types (e.g., binary or count-valued). Moreover, multi-source data often have block-wise missing structure, i.e., data in one or more sources may be completely unobserved for a sample. The heterogeneous data types and presence of block-wise missing data pose significant challenges to the integration of multi-source data and further statistical analyses. In this paper, we develop a low-rank method, called Generalized Integrative Principal Component Analysis (GIPCA), for the simultaneous dimension reduction and imputation of multi-source block-wise missing data, where different sources may have different data types. We also devise an adapted BIC criterion for rank estimation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Advanced Neuroimaging Techniques and Applications
