Binary Classification: Is Boosting stronger than Bagging?

Dimitris Bertsimas; Vasiliki Stoumpou

arXiv:2410.19200·cs.LG·October 28, 2024

Binary Classification: Is Boosting stronger than Bagging?

Dimitris Bertsimas, Vasiliki Stoumpou

PDF

Open Access

TL;DR

This paper introduces Enhanced Random Forests with adaptive weighting, significantly improving binary classification performance and interpretability, and demonstrating that bagging can match boosting in effectiveness with added transparency.

Contribution

The authors propose an adaptive weighting extension to Random Forests that enhances performance and interpretability, showing bagging can be as strong as boosting in binary classification.

Findings

01

Enhanced Random Forests outperform standard RFs on 15 datasets

02

The method matches or exceeds XGBoost performance with default parameters

03

Tree importance scores enable partial interpretability of ensemble decisions

Abstract

Random Forests have been one of the most popular bagging methods in the past few decades, especially due to their success at handling tabular datasets. They have been extensively studied and compared to boosting models, like XGBoost, which are generally considered more performant. Random Forests adopt several simplistic assumptions, such that all samples and all trees that form the forest are equally important for building the final model. We introduce Enhanced Random Forests, an extension of vanilla Random Forests with extra functionalities and adaptive sample and model weighting. We develop an iterative algorithm for adapting the training sample weights, by favoring the hardest examples, and an approach for finding personalized tree weighting schemes for each new sample. Our method significantly improves upon regular Random Forests across 15 different binary classification datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare

MethodsFocus