Sharp concentration of uniform generalization errors in binary linear classification

Shogo Nakakita

arXiv:2505.16713·stat.ML·June 27, 2025

Sharp concentration of uniform generalization errors in binary linear classification

Shogo Nakakita

PDF

Open Access

TL;DR

This paper investigates how uniformly the generalization errors in binary linear classification concentrate around their expected values, providing sharp bounds and broad asymptotic convergence results.

Contribution

It introduces new concentration bounds for generalization errors using isoperimetric inequalities and establishes uniform laws of large numbers in high-dimensional settings.

Findings

01

Concentration bounds are sharp up to constants for well-balanced labels.

02

Almost sure convergence of errors occurs in high-dimensional regimes.

03

Uniform laws of large numbers hold under dimension-free conditions.

Abstract

We examine the concentration of uniform generalization errors around their expectation in binary linear classification problems via an isoperimetric argument. In particular, we establish Poincar\'{e} and log-Sobolev inequalities for the joint distribution of the output labels and the label-weighted input vectors, which we apply to derive concentration bounds. The derived concentration bounds are sharp up to moderate multiplicative constants by those under well-balanced labels. In asymptotic analysis, we also show that almost sure convergence of uniform generalization errors to their expectation occurs in very broad settings, such as proportionally high-dimensional regimes. Using this convergence, we establish uniform laws of large numbers under dimension-free conditions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Computational Techniques in Science and Engineering · Face and Expression Recognition · Advanced Statistical Methods and Models