Narrowing the Gap: Random Forests In Theory and In Practice

Misha Denil; David Matheson; Nando de Freitas

arXiv:1310.1415·stat.ML·October 8, 2013·175 cites

Narrowing the Gap: Random Forests In Theory and In Practice

Misha Denil, David Matheson, Nando de Freitas

PDF

Open Access

TL;DR

This paper advances understanding of random forests by introducing a new theoretically tractable variant, proving its consistency, and empirically comparing it to practical models to analyze their differences.

Contribution

It presents a new consistent, theoretically tractable random regression forest model and compares it empirically with existing practical algorithms.

Findings

01

The new model is proven to be consistent.

02

Empirical results highlight differences between theoretical and practical models.

03

Insights into simplifications used in theoretical models.

Abstract

Despite widespread interest and practical use, the theoretical properties of random forests are still not well understood. In this paper we contribute to this understanding in two ways. We present a new theoretically tractable variant of random regression forests and prove that our algorithm is consistent. We also provide an empirical evaluation, comparing our algorithm and other theoretically tractable random forest models to the random forest algorithm used in practice. Our experiments provide insight into the relative importance of different simplifications that theoreticians have made to obtain tractable models for analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Data Management and Algorithms · Machine Learning and Data Classification