A Model of Double Descent for High-dimensional Binary Linear Classification
Zeyu Deng, Abla Kammoun, Christos Thrampoulidis

TL;DR
This paper models high-dimensional binary classification using logistic regression with gradient descent, revealing a phase transition in error behavior and demonstrating a double descent phenomenon in classification error as the overparameterization ratio varies.
Contribution
It provides a theoretical analysis of the implicit bias of gradient descent in high-dimensional logistic regression, identifying a phase transition and characterizing the double descent phenomenon.
Findings
Classification error exhibits a phase transition at a critical overparameterization ratio.
Double descent phenomenon observed in classification error as model complexity increases.
Theoretical predictions validated by numerical experiments.
Abstract
We consider a model for logistic regression where only a subset of features of size is used for training a linear classifier over training samples. The classifier is obtained by running gradient descent (GD) on logistic loss. For this model, we investigate the dependence of the classification error on the overparameterization ratio . First, building on known deterministic results on the implicit bias of GD, we uncover a phase-transition phenomenon for the case of Gaussian features: the classification error of GD is the same as that of the maximum-likelihood (ML) solution when , and that of the max-margin (SVM) solution when . Next, using the convex Gaussian min-max theorem (CGMT), we sharply characterize the performance of both the ML and the SVM solutions. Combining these results, we obtain curves that explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Machine Learning and Algorithms
MethodsLogistic Regression · Support Vector Machine · Linear Regression
