Best-scored Random Forest Classification

Hanyuan Hang; Xiaoyu Liu; and Ingo Steinwart

arXiv:1905.11028·stat.ML·May 28, 2019·1 cites

Best-scored Random Forest Classification

Hanyuan Hang, Xiaoyu Liu, and Ingo Steinwart

PDF

Open Access

TL;DR

This paper introduces a best-scored random forest algorithm that selects the best-performing trees from random candidates, achieving higher accuracy and theoretical optimality in binary classification tasks.

Contribution

It proposes a novel best-scored selection method for random forests, with theoretical convergence guarantees and practical efficiency improvements.

Findings

01

Achieves higher accuracy than traditional random forests

02

Establishes almost optimal convergence rates under certain conditions

03

Demonstrates superior performance in numerical experiments

Abstract

We propose an algorithm named best-scored random forest for binary classification problems. The terminology "best-scored" means to select the one with the best empirical performance out of a certain number of purely random tree candidates as each single tree in the forest. In this way, the resulting forest can be more accurate than the original purely random forest. From the theoretical perspective, within the framework of regularized empirical risk minimization penalized on the number of splits, we establish almost optimal convergence rates for the proposed best-scored random trees under certain conditions which can be extended to the best-scored random forest. In addition, we present a counterexample to illustrate that in order to ensure the consistency of the forest, every dimension must have the chance to be split. In the numerical experiments, for the sake of efficiency, we employ…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Face and Expression Recognition · Sparse and Compressive Sensing Techniques