Binary Classification: Is Boosting stronger than Bagging?
Dimitris Bertsimas, Vasiliki Stoumpou

TL;DR
This paper introduces Enhanced Random Forests with adaptive weighting, significantly improving binary classification performance and interpretability, and demonstrating that bagging can match boosting in effectiveness with added transparency.
Contribution
The authors propose an adaptive weighting extension to Random Forests that enhances performance and interpretability, showing bagging can be as strong as boosting in binary classification.
Findings
Enhanced Random Forests outperform standard RFs on 15 datasets
The method matches or exceeds XGBoost performance with default parameters
Tree importance scores enable partial interpretability of ensemble decisions
Abstract
Random Forests have been one of the most popular bagging methods in the past few decades, especially due to their success at handling tabular datasets. They have been extensively studied and compared to boosting models, like XGBoost, which are generally considered more performant. Random Forests adopt several simplistic assumptions, such that all samples and all trees that form the forest are equally important for building the final model. We introduce Enhanced Random Forests, an extension of vanilla Random Forests with extra functionalities and adaptive sample and model weighting. We develop an iterative algorithm for adapting the training sample weights, by favoring the hardest examples, and an approach for finding personalized tree weighting schemes for each new sample. Our method significantly improves upon regular Random Forests across 15 different binary classification datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
MethodsFocus
