Tight bounds for maximum $\ell_1$-margin classifiers
Stefan Stojanovic, Konstantin Donhauser, Fanny Yang

TL;DR
This paper establishes tight bounds on the prediction error of the maximum -margin classifier in high-dimensional settings, revealing its limitations and benign overfitting behavior.
Contribution
It provides the first rigorous analysis of the maximum -margin classifier's error bounds, showing they do not adapt to sparse ground truths and demonstrating benign overfitting in noisy scenarios.
Findings
Prediction error bounds match existing rates of /3 for sparse ground truths.
Error vanishes at a rate of 1/(/d/n) in noisy settings.
First demonstration of benign overfitting for maximum -margin classifiers.
Abstract
Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum -margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separable. Previous works consistently show that many estimators relying on the -norm achieve improved statistical rates for hard sparse ground truths. We show that surprisingly, this adaptivity does not apply to the maximum -margin classifier for a standard discriminative setting. In particular, for the noiseless setting, we prove tight upper and lower bounds for the prediction error that match existing rates of order for general ground truths. To complete the picture, we show that when interpolating noisy observations, the error vanishes at a rate of order . We are therefore first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
MethodsSupport Vector Machine
