RaJIVE: Robust Angle Based JIVE for Integrating Noisy Multi-Source Data
Erica Ponzi, Magne Thoresen, Abhik Ghosh

TL;DR
RaJIVE introduces a robust extension to the aJIVE method for multi-source data integration, effectively handling outliers and noise to accurately identify joint and individual data patterns.
Contribution
The paper develops RaJIVE, a robust version of aJIVE that incorporates robust SVD, improving stability and accuracy in the presence of data contamination.
Findings
RaJIVE outperforms aJIVE in contaminated data scenarios.
Simulation studies demonstrate improved decomposition accuracy.
Application to breast cancer data shows practical utility.
Abstract
With increasing availability of high dimensional, multi-source data, the identification of joint and data specific patterns of variability has become a subject of interest in many research areas. Several matrix decomposition methods have been formulated for this purpose, for example JIVE (Joint and Individual Variation Explained), and its angle based variation, aJIVE. Although the effect of data contamination on the estimated joint and individual components has not been considered in the literature, gross errors and outliers in the data can cause instability in such methods, and lead to incorrect estimation of joint and individual variance components. We focus on the aJIVE factorization method and provide a thorough analysis of the effect outliers on the resulting variation decomposition. After showing that such effect is not negligible when all data-sources are contaminated, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Gene expression and cancer classification · Genetic Mapping and Diversity in Plants and Animals
