TL;DR
This paper introduces RFCCA, a novel random forest-based method for estimating how the relationship between two variable sets depends on covariates, with applications demonstrated on EEG data.
Contribution
The paper presents RFCCA, a new approach that uses random forests to estimate conditional canonical correlations influenced by covariates, including a significance test for covariate effects.
Findings
RFCCA accurately estimates conditional canonical correlations.
The method controls Type-1 error effectively.
Application to EEG data demonstrates practical utility.
Abstract
Investigating the relationships between two sets of variables helps to understand their interactions and can be done with canonical correlation analysis (CCA). However, the correlation between the two sets can sometimes depend on a third set of covariates, often subject-related ones such as age, gender, or other clinical measures. In this case, applying CCA to the whole population is not optimal and methods to estimate conditional CCA, given the covariates, can be useful. We propose a new method called Random Forest with Canonical Correlation Analysis (RFCCA) to estimate the conditional canonical correlations between two sets of variables given subject-related covariates. The individual trees in the forest are built with a splitting rule specifically designed to partition the data to maximize the canonical correlation heterogeneity between child nodes. We also propose a significance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
