TL;DR
This paper introduces multivariate tree boosting, a new method extending gradient boosted trees to handle multiple outcomes and many predictors, improving interpretability and predictive accuracy in large psychological datasets.
Contribution
The paper presents a novel multivariate extension of gradient boosted trees, with an R package 'mvtboost' for analyzing complex multivariate data in psychology.
Findings
Effectively identifies important predictors and non-linear effects.
Achieves high prediction accuracy, outperforming traditional methods.
Detects predictors influencing multiple outcomes without parametric assumptions.
Abstract
Technology and collaboration enable dramatic increases in the size of psychological and psychiatric data collections, but finding structure in these large data sets with many collected variables is challenging. Decision tree ensembles like random forests (Strobl, Malley, and Tutz, 2009) are a useful tool for finding structure, but are difficult to interpret with multiple outcome variables which are often of interest in psychology. To find and interpret structure in data sets with multiple outcomes and many predictors (possibly exceeding the sample size), we introduce a multivariate extension to a decision tree ensemble method called Gradient Boosted Regression Trees (Friedman, 2001). Our method, multivariate tree boosting, can be used for identifying important predictors, detecting predictors with non-linear effects and interactions without specification of such effects, and for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
