# Consistent Estimation of Residual Variance with Random Forest Out-Of-Bag   Errors

**Authors:** Burim Ramosaj, Markus Pauly

arXiv: 1812.06270 · 2018-12-18

## TL;DR

This paper develops and proves the consistency of residual variance estimators based on out-of-bag errors in random forests, addressing a key gap in regression uncertainty quantification.

## Contribution

It introduces a novel residual variance estimator for random forests and establishes its theoretical consistency under L2-regularity assumptions.

## Key findings

- The estimator is consistent for residual variance in random forests.
- Provides theoretical foundation for uncertainty quantification in random forest regression.
- Addresses a previously underexplored aspect of residual variance estimation in machine learning.

## Abstract

The issue of estimating residual variance in regression models has experienced relatively little attention in the machine learning community. However, the estimate is of primary interest in many practical applications, e.g. as a primary step towards the construction of prediction intervals. Here, we consider this issue for the random forest. Therein, the functional relationship between covariates and response variable is modeled by a weighted sum of the latter. The dependence structure is, however, involved in the weights that are constructed during the tree construction process making the model complex in mathematical analysis. Restricting to L2-consistent random forest models, we provide random forest based residual variance estimators and prove their consistency.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.06270/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/1812.06270/full.md

---
Source: https://tomesphere.com/paper/1812.06270