Multi forests: Variable importance for multi-class outcomes
Roman Hornung (1, 2), Alexander Hapfelmeier (3) ((1) Institute for, Medical Information Processing, Biometry, Epidemiology, LMU Munich,, Munich, Germany, (2) Munich Center for Machine Learning (MCML), Munich,, Germany, (3) Institute of AI, Informatics in Medicine, TUM School of

TL;DR
This paper introduces multi forests and a novel variable importance measure tailored for multi-class outcomes, effectively identifying covariates associated with specific classes, which traditional methods struggle to distinguish.
Contribution
The paper presents multi forests, a new RF variant with multi-way splits, and a multi-class VIM that accurately identifies class-specific covariates, advancing variable importance analysis in multi-class prediction.
Findings
Multi-class VIM effectively ranks class-associated covariates.
MuFs slightly underperform compared to conventional RFs in prediction.
Simulation and real data analyses validate the method's ability to identify relevant covariates.
Abstract
In prediction tasks with multi-class outcomes, identifying covariates specifically associated with one or more outcome classes can be important. Conventional variable importance measures (VIMs) from random forests (RFs), like permutation and Gini importance, focus on overall predictive performance or node purity, without differentiating between the classes. Therefore, they can be expected to fail to distinguish class-associated covariates from covariates that only distinguish between groups of classes. We introduce a VIM called multi-class VIM, tailored for identifying exclusively class-associated covariates, via a novel RF variant called multi forests (MuFs). The trees in MuFs use both multi-way and binary splitting. The multi-way splits generate child nodes for each class, using a split criterion that evaluates how well these nodes represent their respective classes. This setup forms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForest ecology and management · Forest Management and Policy
MethodsFocus
