A Random Forest Approach for Modeling Bounded Outcomes
Leonie Weinhold, Matthias Schmid, Marvin N. Wright, Moritz, Berger

TL;DR
This paper introduces a novel random forest method tailored for modeling bounded outcomes within the unit interval, leveraging the beta distribution's likelihood for improved heteroscedasticity handling.
Contribution
The paper proposes a new random forest approach that explicitly incorporates the beta distribution's likelihood for better modeling of bounded outcomes.
Findings
Outperforms classical random forests on bounded data
Effectively captures heteroscedasticity in the data
Demonstrates advantages over parametric regression models
Abstract
Random forests have become an established tool for classification and regression, in particular in high-dimensional settings and in the presence of complex predictor-response relationships. For bounded outcome variables restricted to the unit interval, however, classical random forest approaches may severely suffer as they do not account for the heteroscedasticity in the data. A random forest approach is proposed for relating beta distributed outcomes to explanatory variables. The approach explicitly makes use of the likelihood function of the beta distribution for the selection of splits during the tree-building procedure. In each iteration of the tree-building algorithm one chooses the combination of explanatory variable and splitting rule that maximizes the log-likelihood function of the beta distribution with the parameter estimates derived from the nodes of the currently built…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference
