A simple and provable algorithm for sparse diagonal CCA
Megasthenis Asteris, Anastasios Kyrillidis, Oluwasanmi Koyejo, Russell, Poldrack

TL;DR
This paper introduces a simple, efficient, and provably accurate algorithm for sparse diagonal CCA, enabling precise control over sparsity and scalable computation, with applications demonstrated in neuroimaging data analysis.
Contribution
The paper presents a novel combinatorial algorithm for sparse diagonal CCA with theoretical guarantees and linear complexity, improving over existing methods.
Findings
Algorithm achieves linear scalability with input variables.
Provides data-dependent approximation guarantees.
Successfully applied to neuroimaging data analysis.
Abstract
Given two sets of variables, derived from a common set of samples, sparse Canonical Correlation Analysis (CCA) seeks linear combinations of a small number of variables in each set, such that the induced canonical variables are maximally correlated. Sparse CCA is NP-hard. We propose a novel combinatorial algorithm for sparse diagonal CCA, i.e., sparse CCA under the additional assumption that variables within each set are standardized and uncorrelated. Our algorithm operates on a low rank approximation of the input data and its computational complexity scales linearly with the number of input variables. It is simple to implement, and parallelizable. In contrast to most existing approaches, our algorithm administers precise control on the sparsity of the extracted canonical vectors, and comes with theoretical data-dependent global approximation guarantees, that hinge on the spectrum of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Functional Brain Connectivity Studies · Bayesian Methods and Mixture Models
