High-Dimensional Multi-Study Multi-Modality Covariate-Augmented Generalized Factor Model
Wei Liu, Qingzhi Zhong

TL;DR
This paper introduces a high-dimensional generalized factor model that integrates multi-modality data across multiple studies with covariates, addressing limitations of existing methods and providing a scalable, interpretable solution with proven theoretical properties.
Contribution
It develops a novel high-dimensional multi-study multi-modality covariate-augmented factor model with identifiability analysis, variational inference, and an efficient EM algorithm, advancing multi-source data integration.
Findings
Outperforms existing methods in accuracy and efficiency
Provides theoretical guarantees for estimators
Demonstrates effectiveness on real-world data
Abstract
Latent factor models that integrate data from multiple sources/studies or modalities have garnered considerable attention across various disciplines. However, existing methods predominantly focus either on multi-study integration or multi-modality integration, rendering them insufficient for analyzing the diverse modalities measured across multiple studies. To address this limitation and cater to practical needs, we introduce a high-dimensional generalized factor model that seamlessly integrates multi-modality data from multiple studies, while also accommodating additional covariates. We conduct a thorough investigation of the identifiability conditions to enhance the model's interpretability. To tackle the complexity of high-dimensional nonlinear integration caused by four large latent random matrices, we utilize a variational lower bound to approximate the observed log-likelihood by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTechnology and Data Analysis
