The Dynamics of AdaBoost Weights Tells You What's Hard to Classify
Bruno Caprile, Cesare Furlanello & Stefano Merler

TL;DR
This paper explores how the evolution of weights in AdaBoost reveals which data points are easy or hard to classify, providing insights into model construction and uncertainty regions.
Contribution
It introduces a novel analysis of AdaBoost weight dynamics to identify data point difficulty and uncertainty, enhancing understanding of model behavior.
Findings
Weight dynamics partition data into easy and hard classes.
Entropy measures quantify the relevance of hard points.
Methods improve sampling strategies within the Optimal Sampling framework.
Abstract
The dynamical evolution of weights in the Adaboost algorithm contains useful information about the role that the associated data points play in the built of the Adaboost model. In particular, the dynamics induces a bipartition of the data set into two (easy/hard) classes. Easy points are ininfluential in the making of the model, while the varying relevance of hard points can be gauged in terms of an entropy value associated to their evolution. Smooth approximations of entropy highlight regions where classification is most uncertain. Promising results are obtained when methods proposed are applied in the Optimal Sampling framework.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Neural Networks and Applications
