Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization
Benjamin Aubin, Florent Krzakala, Yue M. Lu, Lenka Zdeborov\'a

TL;DR
This paper derives a formula for the generalization error of convex regularized classifiers in high-dimensional settings, showing that logistic and hinge losses can nearly achieve Bayes-optimal performance, and proposes an optimal loss and regularizer.
Contribution
It provides a rigorous formula for generalization error in high dimensions, demonstrating near-optimality of common losses and designing an optimal loss and regularizer.
Findings
Logistic and hinge regression approach Bayes-optimal error as sample size increases.
Ridge regression performs poorly compared to logistic and hinge methods.
An optimal loss and regularizer are proposed that achieve Bayes-optimal error.
Abstract
We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where is kept finite in the limit of a high dimension and number of samples . Our contribution is three-fold: First, we prove a formula for the generalization error achieved by regularized classifiers that minimize a convex loss. This formula was first obtained by the heuristic replica method of statistical physics. Secondly, focussing on commonly used loss functions and optimizing the regularization strength, we observe that while ridge regression performance is poor, logistic and hinge regression are surprisingly able to approach the Bayes-optimal generalization error extremely closely.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM
