On the Doubt about Margin Explanation of Boosting
Wei Gao, Zhi-Hua Zhou

TL;DR
This paper critically examines the margin theory behind AdaBoost's success, introduces a new $k$th margin bound, and improves existing empirical bounds, ultimately providing sharper generalization error bounds based on margin distribution.
Contribution
It introduces the $k$th margin bound, refines empirical Bernstein bounds, and offers a sharper generalization error bound considering margin distribution factors.
Findings
The $k$th margin bound relates to previous margin bounds.
Improved empirical Bernstein bounds enhance margin analysis.
New generalization bounds incorporate average margin and variance.
Abstract
Margin theory provides one of the most popular explanations to the success of \texttt{AdaBoost}, where the central point lies in the recognition that \textit{margin} is the key for characterizing the performance of \texttt{AdaBoost}. This theory has been very influential, e.g., it has been used to argue that \texttt{AdaBoost} usually does not overfit since it tends to enlarge the margin even after the training error reaches zero. Previously the \textit{minimum margin bound} was established for \texttt{AdaBoost}, however, \cite{Breiman1999} pointed out that maximizing the minimum margin does not necessarily lead to a better generalization. Later, \cite{Reyzin:Schapire2006} emphasized that the margin distribution rather than minimum margin is crucial to the performance of \texttt{AdaBoost}. In this paper, we first present the \textit{th margin bound} and further study on its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
