Dimension reduction based on constrained canonical correlation and variable filtering
Jianhui Zhou, Xuming He

TL;DR
This paper introduces a constrained canonical correlation ($C^3$) method combined with variable filtering to improve interpretability and maintain predictive power in high-dimensional data analysis, outperforming existing methods like SSIR.
Contribution
The paper proposes a novel $C^3$ method that constrains canonical correlation estimates and incorporates variable filtering for better interpretability without losing accuracy.
Findings
$C^3$ outperforms SSIR in simulation studies.
The method produces more interpretable composite directions.
Applications demonstrate practical usefulness.
Abstract
The ``curse of dimensionality'' has remained a challenge for high-dimensional data analysis in statistics. The sliced inverse regression (SIR) and canonical correlation (CANCOR) methods aim to reduce the dimensionality of data by replacing the explanatory variables with a small number of composite directions without losing much information. However, the estimated composite directions generally involve all of the variables, making their interpretation difficult. To simplify the direction estimates, Ni, Cook and Tsai [Biometrika 92 (2005) 242--247] proposed the shrinkage sliced inverse regression (SSIR) based on SIR. In this paper, we propose the constrained canonical correlation () method based on CANCOR, followed by a simple variable filtering method. As a result, each composite direction consists of a subset of the variables for interpretability as well as predictive power. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
