Unconstrained Stochastic CCA: Unifying Multiview and Self-Supervised Learning
James Chapman, Lennie Wells, Ana Lawry Aguila

TL;DR
This paper introduces fast stochastic algorithms for CCA and PLS that outperform previous methods in speed and accuracy, enabling large-scale biomedical data analysis and enhancing self-supervised learning performance.
Contribution
The paper proposes a novel unconstrained objective and stochastic gradient algorithms for CCA, PLS, and Deep CCA, unifying multiview and self-supervised learning with improved efficiency.
Findings
Faster convergence and higher correlation recovery than previous methods.
Successful large-scale biomedical data analysis with over 33,000 individuals.
Matching SSL methods' performance on CIFAR datasets with minimal tuning.
Abstract
The Canonical Correlation Analysis (CCA) family of methods is foundational in multiview learning. Regularised linear CCA methods can be seen to generalise Partial Least Squares (PLS) and be unified with a Generalized Eigenvalue Problem (GEP) framework. However, classical algorithms for these linear methods are computationally infeasible for large-scale data. Extensions to Deep CCA show great promise, but current training procedures are slow and complicated. First we propose a novel unconstrained objective that characterizes the top subspace of GEPs. Our core contribution is a family of fast algorithms for stochastic PLS, stochastic CCA, and Deep CCA, simply obtained by applying stochastic gradient descent (SGD) to the corresponding CCA objectives. Our algorithms show far faster convergence and recover higher correlations than the previous state-of-the-art on all standard CCA and Deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStatistical Methods and Inference · Face and Expression Recognition · Gene expression and cancer classification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
