Binary Classification of Gaussian Mixtures: Abundance of Support Vectors, Benign Overfitting and Regularization
Ke Wang, Christos Thrampoulidis

TL;DR
This paper analyzes binary linear classification under Gaussian mixture models, revealing conditions where overparameterized interpolating classifiers achieve optimal performance and outperform regularized methods, highlighting the importance of SNR and data covariance.
Contribution
It provides new non-asymptotic bounds and conditions for optimality of interpolating classifiers in Gaussian mixture models, extending understanding of overparameterization benefits.
Findings
Interpolating classifiers can achieve asymptotic optimality under certain spectral and SNR conditions.
The results extend to noisy models with label flips, emphasizing SNR's role.
Interpolating estimators can outperform regularized methods under specific data conditions.
Abstract
Deep neural networks generalize well despite being exceedingly overparameterized and being trained without explicit regularization. This curious phenomenon has inspired extensive research activity in establishing its statistical principles: Under what conditions is it observed? How do these depend on the data and on the training algorithm? When does regularization benefit generalization? While such questions remain wide open for deep neural nets, recent works have attempted gaining insights by studying simpler, often linear, models. Our paper contributes to this growing line of work by examining binary linear classification under a generative Gaussian mixture model. Motivated by recent results on the implicit bias of gradient descent, we study both max-margin SVM classifiers (corresponding to logistic loss) and min-norm interpolating classifiers (corresponding to least-squares loss).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Algorithms · Machine Learning and Data Classification
MethodsSupport Vector Machine · Linear Regression
