A greedy approach to sparse canonical correlation analysis
Ami Wiesel, Mark Kliger, Alfred O. Hero III

TL;DR
This paper introduces a greedy algorithm for sparse canonical correlation analysis that efficiently finds highly correlated linear combinations with limited variables, suitable for large datasets and small sample sizes.
Contribution
It presents a novel greedy approximation method for sparse CCA that is computationally efficient and effective for high-dimensional data analysis.
Findings
Significant correlation can be captured with few variables.
The method is computationally scalable for large datasets.
Sparse CCA acts as a regularizer in small sample scenarios.
Abstract
We consider the problem of sparse canonical correlation analysis (CCA), i.e., the search for two linear combinations, one for each multivariate, that yield maximum correlation using a specified number of variables. We propose an efficient numerical approximation based on a direct greedy approach which bounds the correlation at each stage. The method is specifically designed to cope with large data sets and its computational complexity depends only on the sparsity levels. We analyze the algorithm's performance through the tradeoff between correlation and parsimony. The results of numerical simulation suggest that a significant portion of the correlation may be captured using a relatively small number of variables. In addition, we examine the use of sparse CCA as a regularization method when the number of available samples is small compared to the dimensions of the multivariates.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Face and Expression Recognition · Spectroscopy and Chemometric Analyses
