The Evolution of Boosting Algorithms - From Machine Learning to Statistical Modelling
Andreas Mayr, Harald Binder, Olaf Gefeller, Matthias Schmid

TL;DR
This paper reviews the evolution of boosting algorithms from their origins in machine learning to their application in statistical modelling, emphasizing methodological similarities and interpretability benefits.
Contribution
It provides a comprehensive overview of boosting algorithms, comparing AdaBoost, gradient boosting, and likelihood-based boosting, and discusses their software implementations and interpretability.
Findings
Statistical boosting offers more interpretable models than initial machine learning algorithms.
Gradient boosting and likelihood-based boosting share fundamental concepts despite different applications.
The paper includes practical R code examples for implementing boosting methods.
Abstract
The concept of boosting emerged from the field of machine learning. The basic idea is to boost the accuracy of a weak classifying tool by combining various instances into a more accurate prediction. This general concept was later adapted to the field of statistical modelling. This review article attempts to highlight this evolution of boosting algorithms from machine learning to statistical modelling. We describe the AdaBoost algorithm for classification as well as the two most prominent statistical boosting approaches, gradient boosting and likelihood-based boosting. Although both appraoches are typically treated separately in the literature, they share the same methodological roots and follow the same fundamental concepts. Compared to the initial machine learning algorithms, which must be seen as black-box prediction schemes, statistical boosting result in statistical models which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Statistical Methods and Models · Statistical Methods and Inference
