Resistant Multiple Sparse Canonical Correlation
Jacob Coleman, Joseph Replogle, Gabriel Chandler, Johanna Hardin

TL;DR
This paper introduces a resistant estimation method for Sparse Canonical Correlation Analysis (SCCA) that improves variable selection and the discovery of multiple canonical pairs, especially in high-dimensional biological data.
Contribution
It develops a resistant estimation approach for SCCA, enhancing robustness and accuracy in identifying correlated variables and multiple canonical pairs.
Findings
Resistant estimators outperform standard ones in variable selection.
The method successfully identifies multiple canonical pairs.
Application to biological data demonstrates improved interpretability.
Abstract
Canonical Correlation Analysis (CCA) is a multivariate technique that takes two datasets and forms the most highly correlated possible pairs of linear combinations between them. Each subsequent pair of linear combinations is orthogonal to the preceding pair, meaning that new information is gleaned from each pair. By looking at the magnitude of coefficient values, we can find out which variables can be grouped together, thus better understanding multiple interactions that are otherwise difficult to compute or grasp intuitively. CCA appears to have quite powerful applications to high throughput data, as we can use it to discover, for example, relationships between gene expression and gene copy number variation. One of the biggest problems of CCA is that the number of variables (often upwards of 10,000) makes biological interpretation of linear combinations nearly impossible. To limit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genetic Mapping and Diversity in Plants and Animals · Bioinformatics and Genomic Networks
