On the approximation of the between-set correlation matrix by canonical correlation analysis
Jan Graffelman

TL;DR
This paper improves the approximation of the between-set correlation matrix in canonical correlation analysis by proposing an adjustment method and an efficient algorithm, enhancing visualization and interpretability.
Contribution
It introduces an adjustment technique for the correlation matrix using scalar and effect-based modifications, along with an alternating least squares algorithm for optimal low-rank approximation.
Findings
Adjusted analysis achieves lower root mean squared error compared to classic CCA.
Enhanced biplot visualization with minimal interpretative changes.
Software implementation available in R for practical application.
Abstract
Canonical correlation analysis is a classic well-known multivariate statistical method focusing on the relationships between two sets of variables. The visualisation of those relationships can be achieved by means of a biplot of the between-set correlation matrix. The canonical analysis provides a low-rank approximation to the between-set correlation matrix that is optimal in a generalised least squares sense. This article proposes to adjust the between-set correlation matrix using either a single scalar effect, or column and/or row effects. An alternating generalised least squares algorithm is proposed to obtain optimal adjustments and low-rank factorisations. The adjustment leads to a better approximation of the between-set correlation matrix that achieves a lower root mean squared error in comparison with the classic canonical analysis. The results of the adjusted analysis can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
