A Random Forest Approach for Modeling Bounded Outcomes

Leonie Weinhold; Matthias Schmid; Marvin N. Wright; Moritz; Berger

arXiv:1901.06211·stat.ME·January 21, 2019·1 cites

A Random Forest Approach for Modeling Bounded Outcomes

Leonie Weinhold, Matthias Schmid, Marvin N. Wright, Moritz, Berger

PDF

Open Access

TL;DR

This paper introduces a novel random forest method tailored for modeling bounded outcomes within the unit interval, leveraging the beta distribution's likelihood for improved heteroscedasticity handling.

Contribution

The paper proposes a new random forest approach that explicitly incorporates the beta distribution's likelihood for better modeling of bounded outcomes.

Findings

01

Outperforms classical random forests on bounded data

02

Effectively captures heteroscedasticity in the data

03

Demonstrates advantages over parametric regression models

Abstract

Random forests have become an established tool for classification and regression, in particular in high-dimensional settings and in the presence of complex predictor-response relationships. For bounded outcome variables restricted to the unit interval, however, classical random forest approaches may severely suffer as they do not account for the heteroscedasticity in the data. A random forest approach is proposed for relating beta distributed outcomes to explanatory variables. The approach explicitly makes use of the likelihood function of the beta distribution for the selection of splits during the tree-building procedure. In each iteration of the tree-building algorithm one chooses the combination of explanatory variable and splitting rule that maximizes the log-likelihood function of the beta distribution with the parameter estimates derived from the nodes of the currently built…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference