There is no Double-Descent in Random Forests
Sebastian Buschj\"ager, Katharina Morik

TL;DR
This paper challenges the double-descent phenomenon in Random Forests, showing they have a single descent and that algorithmic factors, not model capacity, explain their success, with implications for ensemble diversity and overfitting.
Contribution
It demonstrates that Random Forests do not exhibit double-descent, emphasizes the importance of training algorithms over model capacity, and introduces a method to control ensemble diversity.
Findings
RFs do not show double-descent, only a single descent.
A RF variation can avoid overfitting despite similar decision boundaries.
Diversity and bias in ensembles significantly impact performance, with a broad optimal trade-off range.
Abstract
Random Forests (RFs) are among the state-of-the-art in machine learning and offer excellent performance with nearly zero parameter tuning. Remarkably, RFs seem to be impervious to overfitting even though their basic building blocks are well-known to overfit. Recently, a broadly received study argued that a RF exhibits a so-called double-descent curve: First, the model overfits the data in a u-shaped curve and then, once a certain model complexity is reached, it suddenly improves its performance again. In this paper, we challenge the notion that model capacity is the correct tool to explain the success of RF and argue that the algorithm which trains the model plays a more important role than previously thought. We show that a RF does not exhibit a double-descent curve but rather has a single descent. Hence, it does not overfit in the classic sense. We further present a RF variation that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Generative Adversarial Networks and Image Synthesis
