Generalization error in high-dimensional perceptrons: Approaching Bayes   error with convex optimization

Benjamin Aubin; Florent Krzakala; Yue M. Lu; Lenka Zdeborov\'a

arXiv:2006.06560·stat.ML·February 18, 2021·27 cites

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

Benjamin Aubin, Florent Krzakala, Yue M. Lu, Lenka Zdeborov\'a

PDF

Open Access 1 Video

TL;DR

This paper derives a formula for the generalization error of convex regularized classifiers in high-dimensional settings, showing that logistic and hinge losses can nearly achieve Bayes-optimal performance, and proposes an optimal loss and regularizer.

Contribution

It provides a rigorous formula for generalization error in high dimensions, demonstrating near-optimality of common losses and designing an optimal loss and regularizer.

Findings

01

Logistic and hinge regression approach Bayes-optimal error as sample size increases.

02

Ridge regression performs poorly compared to logistic and hinge methods.

03

An optimal loss and regularizer are proposed that achieve Bayes-optimal error.

Abstract

We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where $α = n / d$ is kept finite in the limit of a high dimension $d$ and number of samples $n$ . Our contribution is three-fold: First, we prove a formula for the generalization error achieved by $ℓ_{2}$ regularized classifiers that minimize a convex loss. This formula was first obtained by the heuristic replica method of statistical physics. Secondly, focussing on commonly used loss functions and optimizing the $ℓ_{2}$ regularization strength, we observe that while ridge regression performance is poor, logistic and hinge regression are surprisingly able to approach the Bayes-optimal generalization error extremely closely.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization· slideslive

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM