Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization

Milad Sefidgaran; Kimia Nadjahi; Abdellatif Zaidi

arXiv:2510.23485·stat.ML·October 28, 2025

Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization

Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi

PDF

TL;DR

This paper introduces tighter CMI-based generalization bounds using stochastic projection and quantization, demonstrating improved guarantees and insights into the necessity of memorization for generalization in learning algorithms.

Contribution

It presents novel CMI bounds leveraging stochastic projection and quantization, improving over existing bounds and analyzing memorization's role in generalization.

Findings

01

New CMI bounds are tighter than existing ones.

02

Bounds achieve (1/\u221a{n}) guarantees for certain problems.

03

Memorization is not necessary for good generalization.

Abstract

In this paper, we leverage stochastic projection and lossy compression to establish new conditional mutual information (CMI) bounds on the generalization error of statistical learning algorithms. It is shown that these bounds are generally tighter than the existing ones. In particular, we prove that for certain problem instances for which existing MI and CMI bounds were recently shown in Attias et al. [2024] and Livni [2023] to become vacuous or fail to describe the right generalization behavior, our bounds yield suitable generalization guarantees of the order of $O (1/ n)$ , where $n$ is the size of the training dataset. Furthermore, we use our bounds to investigate the problem of data "memorization" raised in those works, and which asserts that there are learning problem instances for which any learning algorithm that has good prediction there exist distributions under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.