CoCA: Cooperative Component Analysis
Daisy Yi Ding, Alden Green, Min Woo Sun, Robert Tibshirani

TL;DR
CoCA is a novel unsupervised multi-view analysis method that balances within-view variance and cross-view correlation, effectively integrating multiomics data to uncover shared biological signals.
Contribution
It introduces a flexible framework interpolating between PCA and CCA, with a data-adaptive agreement parameter and a sparse variant for feature selection.
Findings
Successfully applied to multiomics COVID-19 data, revealing predictive shared signals.
Effectively integrated multiomics data in breast cancer study, uncovering informative components.
Demonstrated superior performance on simulated data and real-world applications.
Abstract
We propose Cooperative Component Analysis (CoCA), a new method for unsupervised multi-view analysis: it identifies the component that simultaneously captures significant within-view variance and exhibits strong cross-view correlation. The challenge of integrating multi-view data is particularly important in biology and medicine, where various types of "-omic" data, ranging from genomics to proteomics, are measured on the same set of samples. The goal is to uncover important, shared signals that represent underlying biological mechanisms. CoCA combines an approximation error loss to preserve information within data views and an "agreement penalty" to encourage alignment across data views. By balancing the trade-off between these two key components in the objective, CoCA has the property of interpolating between the commonly-used principal component analysis (PCA) and canonical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
