On Leave-One-Out Conditional Mutual Information For Generalization

Mohamad Rida Rammal; Alessandro Achille; Aditya Golatkar; Suhas; Diggavi; Stefano Soatto

arXiv:2207.00581·cs.LG·July 4, 2022·1 cites

On Leave-One-Out Conditional Mutual Information For Generalization

Mohamad Rida Rammal, Alessandro Achille, Aditya Golatkar, Suhas, Diggavi, Stefano Soatto

PDF

Open Access 1 Video

TL;DR

This paper introduces a new leave-one-out conditional mutual information measure to derive practical, interpretable generalization bounds for supervised learning, validated on deep learning image classification tasks.

Contribution

It proposes a novel loo-CMI measure that is easy to compute and connects to existing concepts like cross-validation and stability, improving generalization bounds.

Findings

01

Bounds are non-vacuous on large-scale image classification.

02

Loo-CMI bounds are computationally feasible and interpretable.

03

Empirical validation shows accurate prediction of generalization gap.

Abstract

We derive information theoretic generalization bounds for supervised learning algorithms based on a new measure of leave-one-out conditional mutual information (loo-CMI). Contrary to other CMI bounds, which are black-box bounds that do not exploit the structure of the problem and may be hard to evaluate in practice, our loo-CMI bounds can be computed easily and can be interpreted in connection to other notions such as classical leave-one-out cross-validation, stability of the optimization algorithm, and the geometry of the loss-landscape. It applies both to the output of training algorithms as well as their predictions. We empirically validate the quality of the bound by evaluating its predicted generalization gap in scenarios for deep learning. In particular, our bounds are non-vacuous on large-scale image-classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Leave-One-Out Conditional Mutual Information For Generalization· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Neural Networks and Applications