Tight bounds for maximum $\ell_1$-margin classifiers

Stefan Stojanovic; Konstantin Donhauser; Fanny Yang

arXiv:2212.03783·stat.ML·January 23, 2023

Tight bounds for maximum $\ell_1$-margin classifiers

Stefan Stojanovic, Konstantin Donhauser, Fanny Yang

PDF

Open Access

TL;DR

This paper establishes tight bounds on the prediction error of the maximum -margin classifier in high-dimensional settings, revealing its limitations and benign overfitting behavior.

Contribution

It provides the first rigorous analysis of the maximum -margin classifier's error bounds, showing they do not adapt to sparse ground truths and demonstrating benign overfitting in noisy scenarios.

Findings

01

Prediction error bounds match existing rates of /3 for sparse ground truths.

02

Error vanishes at a rate of 1/(/d/n) in noisy settings.

03

First demonstration of benign overfitting for maximum -margin classifiers.

Abstract

Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum $ℓ_{1}$ -margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separable. Previous works consistently show that many estimators relying on the $ℓ_{1}$ -norm achieve improved statistical rates for hard sparse ground truths. We show that surprisingly, this adaptivity does not apply to the maximum $ℓ_{1}$ -margin classifier for a standard discriminative setting. In particular, for the noiseless setting, we prove tight upper and lower bounds for the prediction error that match existing rates of order $\frac{∥ w ^{*} ∥ _{1}^{2/3}}{n ^{1/3}}$ for general ground truths. To complete the picture, we show that when interpolating noisy observations, the error vanishes at a rate of order $\frac{1}{l o g ( d / n )}$ . We are therefore first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques

MethodsSupport Vector Machine