Comparison of Canonical Correlation and Partial Least Squares analyses of simulated and empirical data
Anthony R McIntosh

TL;DR
This study compares Canonical Correlation Analysis (CCA) and Partial Least Squares (PLS) on simulated and real datasets, evaluating their sensitivity, reliability, and reproducibility across varying sample sizes and data structures.
Contribution
It provides a comprehensive comparison of CCA and PLS performance, highlighting conditions where each method is reliable and proposing PCA as a remedy for high within-block correlations.
Findings
CCA and PLS show similar performance on data with moderate correlations.
High within-block correlations can reduce CCA's reproducibility, but PCA can mitigate this.
Null hypothesis testing does not ensure reproducibility, even with large samples.
Abstract
In this paper, we compared the general forms of CCA and PLS on three simulated and two empirical datasets, all having large sample sizes. We took successively smaller subsamples of these data to evaluate sensitivity, reliability, and reproducibility. In null data having no correlation within or between blocks, both methods showed equivalent false positive rates across sample sizes. Both methods also showed equivalent detection in data with weak but reliable effects until sample sizes drop below n=50. In the case of strong effects, both methods showed similar performance unless the correlations of items within one data block were high. For PLS, the results were reproducible across sample sizes for strong effects, except at the smallest sample sizes. On the contrary, the reproducibility for CCA declined when the within-block correlations were high. This was ameliorated if a principal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSensory Analysis and Statistical Methods · Spectroscopy and Chemometric Analyses · Advanced Statistical Methods and Models
