The Generalization Error of Supervised Machine Learning Algorithms

Samir M. Perlaza; Xinying Zou

arXiv:2411.12030·cs.LG·January 1, 2026

The Generalization Error of Supervised Machine Learning Algorithms

Samir M. Perlaza, Xinying Zou

PDF

Open Access

TL;DR

This paper introduces the method of gaps, a novel approach to derive exact formulas for the generalization error of supervised learning algorithms using information measures and Gibbs probability measures.

Contribution

The paper presents a new method of gaps that provides closed-form expressions for generalization error, connecting it with information theory and Gibbs measures, and unifies existing results.

Findings

01

Provides exact formulas for generalization error using information measures.

02

Introduces the concept of algorithm-driven and data-driven gaps.

03

Establishes connections between generalization error and Gibbs probability measures.

Abstract

In this paper, the method of gaps, a technique for deriving closed-form expressions in terms of information measures for the generalization error of supervised machine learning algorithms is introduced. The method relies on the notion of \emph{gaps}, which characterize the variation of the expected empirical risk (when either the model or dataset is kept fixed) with respect to changes in the probability measure on the varying parameter (either the dataset or the model, respectively). This distinction results in two classes of gaps: Algorithm-driven gaps (fixed dataset) and data-driven gaps (fixed model). In general, the method relies on two central observations: $(i)$ ~The generalization error is the expectation of an algorithm-driven gap or a data-driven gap. In the first case, the expectation is with respect to a measure on the datasets; and in the second case, with respect to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications