Error Scaling Laws for Kernel Classification under Source and Capacity   Conditions

Hugo Cui; Bruno Loureiro; Florent Krzakala; Lenka Zdeborov\'a

arXiv:2201.12655·stat.ML·September 7, 2023

Error Scaling Laws for Kernel Classification under Source and Capacity Conditions

Hugo Cui, Bruno Loureiro, Florent Krzakala, Lenka Zdeborov\'a

PDF

Open Access

TL;DR

This paper derives error decay rates for kernel classifiers under source and capacity conditions, providing accurate predictions of learning curves for real datasets and contrasting SVM and ridge classification.

Contribution

It introduces explicit error scaling laws for kernel classification under source and capacity conditions, validated on real data and two standard classifiers.

Findings

01

Derived decay rates for misclassification error as a function of source and capacity coefficients.

02

Rates accurately describe learning curves for real datasets.

03

Contrasted SVM and ridge classification, showing differences in error decay.

Abstract

We consider the problem of kernel classification. While worst-case bounds on the decay rate of the prediction error with the number of samples are known for some classifiers, they often fail to accurately describe the learning curves of real data sets. In this work, we consider the important class of data sets satisfying the standard source and capacity conditions, comprising a number of real data sets as we show numerically. Under the Gaussian design, we derive the decay rates for the misclassification (prediction) error as a function of the source and capacity coefficients. We do so for two standard kernel classification settings, namely margin-maximizing Support Vector Machines (SVM) and ridge classification, and contrast the two methods. We find that our rates tightly describe the learning curves for this class of data sets, and are also observed on real data. Our results can also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms